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Preface 



Linear algebra has in recent years become an essential part of the mathematical 
background required of mathematicians, engineers, physicists and other scientists. 
This requirement reflects the importance and wide applications of the subject matter. 

This book is designed for use as a textbook for a formal course in linear algebra 
or as a supplement to all current standard texts. It aims to present an introduction to 
linear algebra which will be found helpful to all readers regardless of their fields of 
specialization. More material has been included than can be covered in most first 
courses. This has been done to make the book more flexible, to provide a useful book 
of reference, and to stimulate further interest in the subject. 

Each chapter begins with clear statements of pertinent definitions, principles and 
theorems together with illustrative and other descriptive material. This is followed 
by graded sets of solved and supplementary problems. The solved problems serve to 
illustrate and amplify the theory, bring into sharp focus those fine points without 
which the student continually feels himself on unsafe ground, and provide the repetition 
of basic principles so vital to effective learning. Numerous proofs of theorems are 
included among the solved problems. The supplementary problems serve as a complete 
review of the material of each chapter. 

The first three chapters treat of vectors in Euclidean space, linear equations and 
matrices. These provide the motivation and basic computational tools for the abstract 
treatment of vector spaces and linear mappings which follow. A chapter on eigen- 
values and eigenvectors, preceded by determinants, gives conditions for representing 
a linear operator by a diagonal matrix. This naturally leads to the study of various 
canonical forms, specifically the triangular, Jordan and rational canonical forms. 
In the last chapter, on inner product spaces, the spectral theorem for symmetric op- 
erators is obtained and is applied to the diagonalization of real quadratic forms. For 
completeness, the appendices include sections on sets and relations, algebraic structures 
and polynomials over a field. 

I wish to thank many friends and colleagues, especially Dr. Martin Silverstein and 
Dr. Hwa Tsang, for invaluable suggestions and critical review of the manuscript. 
I also want to express my gratitude to Daniel Schaum and Nicola Monti for their very 
helpful cooperation. 

Seymour Lipschutz 

Temple University 
January, 1968 
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Chapter 1 



Vectors in R n and C 



INTRODUCTION 

In various physical applications there appear certain quantities, such as temperature 
and speed, which possess only "magnitude". These can be represented by real numbers and 
are called scalars. On the other hand, there are also quantities, such as force and velocity, 
which possess both "magnitude" and "direction". These quantities can be represented by 
arrows (having appropriate lengths and directions and emanating from some given ref- 
erence point O) and are called vectors. In this chapter we study the properties of such 
vectors in some detail. 

We begin by considering the following operations on vectors. 

(i) Addition: The resultant u + v of two vectors u 
and v is obtained by the so-called parallelogram 
law, i.e. u + v is the diagonal of the parallelogram 
formed by u and v as shown on the right. 

(ii) Scalar multiplication: The product feu of a real 
number ft by a vector u is obtained by multiplying 
the magnitude of u by ft and retaining the same 
direction if fc^O or the opposite direction if 
ft < 0, as shown on the right. 

Now we assume the reader is familiar with the representation of the points in the plane 
by ordered pairs of real numbers. If the origin of the axes is chosen at the reference point 
0 above, then every vector is uniquely determined by the coordinates of its endpoint. The 
relationship between the above operations and endpoints follows. 

(i) Addition: If (a, b) and (c, d) are the endpoints of the vectors u and v, then (a + c, b + d) 
will be the endpoint of u + v, as shown in Fig. (a) below. 




{a + c, b + d) 





(ka, kb) 



Fig. (a) 



Fig. (6) 



(ii) Scalar multiplication: If (a, b) is the endpoint of the vector u, then (ka, kb) will be the 
endpoint of the vector fcu, as shown in Fig. (6) above. 
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Mathematically, we identify a vector with its endpoint; that is, we call the ordered pair 
(a, 6) of real numbers a vector. In fact, we shall generalize this notion and call an «-tuple 
(ai,a 2 , ...,On) of real numbers a vector. We shall again generalize and permit the co- 
ordinates of the ra-tuple to be complex numbers and not just real numbers. Furthermore, in 
Chapter 4, we shall abstract properties of these %-tuples and formally define the mathe- 
matical system called a vector space. 

We assume the reader is familiar with the elementary properties of the real number 
field which we denote by R. 



VECTORS IN R» 

The set of all %-tuples of real numbers, denoted by R", is called n-space. A particular 
ra-tuple in R", say 

U — (Ml, U%, . ■ ■ , Un) 

is called a point or vector; the real numbers im are called the components (or: coordinates) 
of the vector u. Moreover, when discussing the space R" we use the term scalar for the 
elements of R, i.e. for the real numbers. 

Example 1.1: Consider the following vectors: 

(0,1), (1,-3), (1,2, VS. 4), (-5,-i,0, ff ) 

The first two vectors have two components and so are points in R 2 ; the last two 
vectors have four components and so are points in R 4 . 

Two vectors u and v are equal, written u = v, if they have the same number of com- 
ponents, i.e. belong to the same space, and if corresponding components are equal. The 
vectors (1, 2, 3) and (2, 3, 1) are not equal, since corresponding elements are not equal. 

Example 1.2: Suppose {x - y, x + y, z - 1) = (4, 2, 3). Then, by definition of equality of vectors, 

x — y = 4 
x + y = 2 
z - 1 = 3 

Solving the above system of equations gives x = 3, y = — 1, and z — 4. 

VECTOR ADDITION AND SCALAR MULTIPLICATION 

Let u and v be vectors in R n : 

u = (Mi, u 2 , . . . , Un) and v = {vi, v z , . . . , v n ) 
The sum of u and v, written u + v, is the vector obtained by adding corresponding components: 

U + V = (Ui + Vl, U2 + V2, . ■ ■ , Un + Vn) 

The product of a real number k by the vector u, written ku, is the vector obtained by multi- 
plying each component of u by k: 

ku = (kui, ku2, . . . , kun) 
Observe that u + v and ku are also vectors in R". We also define 

-u = -lu and u — v = u + (-v) 
The sum of vectors with different numbers of components is not defined. 
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Example 1.3: Let u = (1,-3,2,4) and v = (3,5,-1,-2). Then 

u + v = (1 + 3,-3 + 5,2-1,4-2) = (4,2,1,2) 

5m = (5-1, 5« (-3), 5 • 2, 5« 4) = (5,-15,10,20) 

2u - 3v = (2, -6, 4, 8) + (-9, -15, 3, 6) = (-7, -21, 7, 14) 

Example 1.4: The vector (0, 0, . . ., 0) in P.", denoted by 0, is called the zero vector. It is similar 
to the scalar 0 in that, for any vector u = (wj, it 2 , . . . , u n ), 

U + 0 = (% + 0, tt 2 + 0, . . . , U n + 0) = 2*2 «*n) = « 

Basic properties of the vectors in R" under the operations of vector addition and scalar 
multiplication are described in the following theorem. 

Theorem 1.1: For any vectors u,v,w G R n and any scalars k, k' G R: 

(i) (u + v) + w = u + (v + w) (v) k(u + v) = ku + kv 

(ii) u + 0 = u (vi) (k + k')u = ku + k'u 

(iii) u + (— w) = 0 (vii) (kk')u = k(k'u) 

(iv) u + v — v + u 1«* = « 

Remark: Suppose u and i; are vectors in R n for which u-kv for some nonzero scalar 
k G R. Then w is said to be in the same direction as v if k > 0, and in the op- 
posite direction if & < 0. 

DOT PRODUCT 

Let « and v be vectors in R": 

u = (ui, uz, . . . , Un) and v = (vi, Vi, . . ., v n ) 

The dot or inner product of u and v, denoted by wv, is the scalar obtained by multiplying 
corresponding components and adding the resulting products: 

WV = UiVi + U2V2 + • • • + U„Vn 

The vectors u and v are said to be orthogonal (or: perpendicular) if their dot product is 
zero: u • v = 0. 

Example 15: Let w = (1, -2, 3, -4), v = (6, 7, 1, -2) and w = (5, -4, 5, 7). Then 

= 1«6 + (-2)-7 + 3-1 + (-4)«(-2) = 6-14 + 3 + 8 = 3 
ww = 1*6 + (-2) '(-4) + 3-6 + (-4) «7 = 5 + 8 + 15-28 = 0 
Thus u and w are orthogonal. 
Basic properties of the dot product in R" follow. 
Theorem 1.2: For any vectors u,v,w G R" and any scalar k G R: 

(i) {u + v) ' w — u • w + v ' w (iii) u • v = v • u 

(ii) {ku)-v = k{wv) (iv) u-u^0, and m-m = 0 iff m = 0 

Remark: The space R" with the above operations of vector addition, scalar multiplication 
and dot product is usually called Euclidean n-space. 

NORM AND DISTANCE IN R" 

Let u and v be vectors in R": u = {u u U2, ...,Un) and v = (vi,V2, . . .,v n ). The dis- 
tance between the points u and v, written d{u,v), is defined by 

d(u,v) = y/{ui - Vi) 2 + (U2 - V2) 2 + •■■ +(un- v n y 
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The norm (or: length) of the vector u, written [|«||, is denned to be the nonnegative square 

root of ww. . 

= y/wu = yu\ + u\ + • • • + u\ 

By Theorem 1.2, wu^O and so the square root exists. Observe that 

d(u, v) — \\u — v\\ 

Example 1.6: Let u = (1,-2, 4,1) and v = (3, 1, -5, 0). Then 

d(u,v) = V(l - 3) 2 + (-2 - D 2 + (4 + 5) 2 + (1 - 0) 2 = V95 
\\v\\ = v^ 2 + l 2 + (-5)2 + 02 = V35 

Now if we consider two points, say p - (a, b) and q = (c, d) in the plane R 2 , then 

||p|| = V" 2 + b 2 and d(p,q) = V(a - e)» + (6 - d)« 

That is, ||p|| corresponds to the usual Euclidean length of the arrow from the origin to the 
point v, and d(p, q) corresponds to the usual Euclidean distance between the points p and 
q, as shown below: 



P = (a, b) 




1~ 
\b-d\ 

_L_ 




-^Ti> = (a, 6) 



I q = (c, d) 

I 



|- la - c| — ►) 



A similar result holds for points on the line R and in space R 3 . 

Remark: A vector e is called a unit vector if its norm is 1: ||e|| = 1. Observe that, for 
any nonzero vector «£R", the vector e« = u/\\u\\ is a unit vector in the same 
direction as u. 

We now state a fundamental relationship known as the Cauchy-Schwarz inequality. 
Theorem 1.3 (Cauchy-Schwarz): For any vectors u,v G R", \u-v\^ \\u\\ \\v\\. 

Using the above inequality, we can now define the angle 9 between any two nonzero 



vectors «,«eR" by 



Note that if u • v = 0, then 9 
definition of orthogonality. 



cos 6 = 



W V 

INI IM 



90° (or: 0 = nil). This then agrees with our previous 



COMPLEX NUMBERS 

The set of complex numbers is denoted by C. Formally, a complex number is an 
ordered pair (a, b) of real numbers; equality, addition and multiplication of complex num- 
bers are defined as follows: 

(a, b) = (c, d) iff a = c and b = d 
(a,b) + (c,d) = {a + c,b + d) 
(a,b)(c,d) = (ae-bd,ad + bc) 
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We identify the real number a with the complex number (a, 0): 

a <r± (a, 0) 

This is possible since the operations of addition and multiplication of real numbers are 
preserved under the correspondence: 

(a, 0) + (b, 0) = (a + b, 0) and (a, 0)(b, 0) = (ab, 0) 
Thus we view E as a subset of C and replace (a, 0) by a whenever convenient and possible. 
The complex number (0, 1), denoted by i, has the important property that 
i 2 = ii = (0, 1)(0, 1) = (-1, 0) = -1 or i = 
Furthermore, using the fact 

(a, 6) = (a, 0) + (0, 6) and (0, 6) = (b, 0)(0, 1) 

we have (a, b) = (a, 0) + (b, 0)(0, 1) = a + bi 

The notation a + bi is more convenient than (a, b). For example, the sum and product of 
complex numbers can be obtained by simply using the commutative and distributive laws 
and i 2 = — 1* 

(a + bi) + (c + di) = a + c + bi + di = (a + c) + (b + d)i 
(a + bi)(c + di) = ac + bci + adi + bdi 2 - (ac - bd) + (be + ad)i 
The conjugate of the complex number z = (a,b) = a + bi is denoted and denned by 

z = a — bi 

(Notice that zz = a 2 + b 2 .) If, in addition, z 0, then the inverse « _1 of z and division by 
z are given by 

* _1 = Tz = ^Tb^ + ^FTb 2i and 7 = wz ~ l 

where w6C. We also define 

-z - -lz and w - z - w + (-z) 

Example 1.7: Suppose z = 2 + Si and w = 5-2i. Then 

z + w = (2 + 3*0 + (5 - 2i) = 2 + 5 + 3i - 2i = 7 + * 
w = (2 + 3i)(5 - 2i) = 10 + 15i - 4t - 6»* = 16 + Hi 
2 = 2 + 3i = 2 - Zi and w = 5 - 2i = 5 + 2i 
5-2^ _ (5 - 2i)(2 - 3i) _ 4 - 19i 



z 



2 + 3i (2 + 3t)(2 - Si) 



13 



4_ 
13 



19 . 
IT 



Just as the real numbers can be represented by the 
points on a line, the complex numbers can be represented 
by the points in the plane. Specifically, we let the point 
(a, b) in the plane represent the complex number z = a + bi, 
i.e. whose real part is a and whose imaginary part is b. The 
absolute value of z, written |z|, is defined as the distance 
from z to the origin: 

|*| = Vo" + b 2 

Note that \z\ is equal to the norm of the vector (a, b). Also, |z| 
Example 1.8: Suppose z 




2 + 3i and w - 12 - 5i. Then 
= VT+~9 = V^3 and \w\ = a/144 + 25 = 13 
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Remark: In Appendix B we define the algebraic structure called a field. We emphasize 
that the set C of complex numbers with the above operations of addition and 
multiplication is a field. 

VECTORS IN C» 

The set of all n-tuples of complex numbers, denoted by C", is called complex n-space. 
Just as in the real case, the elements of C" are called points or vectors, the elements of C 
are called scalars, and vector addition in C" and scalar multiplication on C" are given by 

(Zl, Z 2 , . . . , Zn) + (Wl, W 2 , . . . , »n) = (Zi + Wl, Z2 + W2, . . . , Z„ + W n ) 
z(Zi, Z%, . . . , Z n ) = (ZZi, ZZ 2 , . . . , ZZ n ) 

where z it w if z G C. 

Example 1.9 : (2 + 3i, 4 - i, 3) + (3 - 2i, 5i, 4 - 6i) = (5 + i, 4 + 4i, 7 - 6i) 
2i(2 + 3i, 4 - i, 3) = (-6 + 4i, 2 + 8i, 6t) 

Now let u and v be arbitrary vectors in C": 

U = (Zi, Z 2 , .... Zn), V = (Wl, W2, . . ., W n ), Zi, Wi G C 

The dot, or inner, product of u and v is defined as follows: 

U • V = ZilVi + Z2W2 + ■ • ■ + ZnWn 

Note that this definition reduces to the previous one in the real case, since Wi = Wi when 
Wi is real. The norm of u is defined by 

\\ U \\ = yftFu = +Z2Z2 + ••• + ZnZn = \/>l| 2 + + T77 + |«»| 2 

Observe that wu and so ||tt|| are real and positive when u ¥° 0, and 0 when u = 0. 

Example 1.10: Let u = (2 + Zi, 4 - i, 2i) and v = (3 - 2i, 5, 4- 6i). Then 

u * v = (2 + 3i)(F r 2l) + (4 - i)(B) + (2i)(4 - 6i) 
= (2 + 3i)(3 + 2i) + (4 - i)(5) + (2i)(4 + 6i) 
= 13i + 20 - hi - 12 + 8i = 8 + 16i 

m • u = (2 + 3i)(2 + 3i) + (4 - i)(4-i) + (2i)(2l) 
= (2 + 3i)(2 - 3i) + (4 - i)(4 + i) + (2i)(-2i) 
= 13 + 17 + 4 = 34 

- Vwu = \/34 

The space C" with the above operations of vector addition, scalar multiplication and dot 
product, is called complex Euclidean n-space. 

Remark: If wv were defined by u-v = Z1W1 + ••• + z n w n , then it is possible for 
wu-0 even though u¥>0, e.g. if »=(l,t,0). In fact, wu may not even 
be real. 
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Solved Problems 

VECTORS IN R" 

1.1. Compute: (i) (3,-4,5) + (1,1,-2); (ii) (1,2,-3) + (4,-5); (iii) -3(4,-5,-6); 
(iv) -(-6,7,-8). 

(i) Add corresponding components: (3,-4,5) + (1,1,-2) = (3 + 1,-4 + 1,5-2) = (4,-3,3). 

(ii) The sum is not defined since the vectors have different numbers of components. 

(iii) Multiply each component by the scalar: —3(4, —5, —6) = (—12, 15, 18). 

(iv) Multiply each component by -1: -(—6, 7, —8) = (6, -7, 8). 



1.2. Let u — (2, -7, 1), v = (-3, 0, 4), w = (0, 5, -8). Find (i) Su - 4v, (ii) 2u + Zv - 5w. 
First perform the scalar multiplication and then the vector addition. 

(i) 3u-4v = 3(2, -7, 1) - 4(-3, 0, 4) = (6, -21, 3) + (12, 0, -16) = (18, -21, -13) 

(ii) 2u + 3v - 5w = 2(2, -7, 1) + 3(-3, 0, 4) - 5(0, 5, -8) 

= (4, -14, 2) + (-9, 0, 12) + (0, -25, 40) 

= (4-9 + 0,-14 + 0-25,2 + 12 + 40) = (-5,-39,54) 



1.3. Find x and y if (x, 3) = (2, x + y). 

Since the two vectors are equal, the corresponding components are equal to each other: 

x = 2, 3 = x + y 
Substitute x = 2 into the second equation to obtain y = 1. Thus x = 2 and y = 1. 



1.4. Find x and y if (4, y) = x(2, 3). 

Multiply by the scalar x to obtain (4, y) = x(2, 3) = (2x, 3x). 

Set the corresponding components equal to each other: 4 = 2x, y = 3*. 

Solve the linear equations for x and y: x = 2 and y = 6. 



1.5. Find x, y and z if (2, -3, 4) = 1, 1) + y(\, 1, 0) + z(l, 0, 0). 

First multiply by the scalars x, y and z and then add: 

(2,-3,4) = x(l, 1, 1) + tf(l, 1, 0) + z(l, 0, 0) 

= (*, *, x) + (y, y, 0) + (z, 0, 0) 

= (x + y + z,x + y,x) 

Now set the corresponding components equal to each other: 

x + y + z = 2, x + y = —3, x = 4 

To solve the system of equations, substitute x = 4 into the second equation to obtain 4 + y = — 3 
or y = —7. Then substitute into the first equation to find z = 5. Thus x — 4, y = —7, z = 5. 

1.6. Prove Theorem 1.1: For any vectors u,v,wGK n and any scalars fc, fc'SR, 

(i) (m + v) + w = u + (v + w) (v) k(u + v) — ku + kv 

(ii) « + 0 = u (vi) (fe + k')u — ku + 

(iii) i* + (— it) = 0 (vii) (kk')u - k(k'u) 

(iv) u + v = v + u (viii) lu = u 
Let w i; and w { be the ith components of u, v and w, respectively. 
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(i) By definition, u { + v { is the ith component of u + v and so (u t + v { ) + w t is the ith component 
of (u + v) + w. On the other hand, v t + w 4 is the ith component of v + w and so u { + (v { + w 4 ) 
is the ith component of it + (v + w). But u i( i> t and w { are real numbers for which the as- 
sociative law holds, that is, 

(m 4 + i>j) + Wj = u t + (v f + Wj) for i = 1, . . . , n 
Accordingly, (w + v) + w — u + (v + w) since their corresponding components are equal. 

(ii) Here, 0 = (0, 0, . . ., 0); hence 

u + 0 = (Ml it 2 , . . ., w„) + (0, 0, . . ., 0) 

= (itj + 0, w 2 + 0, ...,M n + 0) = (u u u 2 , . . . , u n ) = u 

(iii) Since —u = — l(u lt u 2 , ...,m„) = (— u lt — m 2 , m„), 

M + (— H) = (U 1( M 2 , . . . , U n ) + (-Mj, -M 2 , . . . , -U n ) 

= («!-«!, ««2- M 2. •■•.«.-%) = (0, 0, ...,0) = 0 

(iv) By definition, u { + t> 4 is the ith component of u + v, and v { + w t is the ith component of v + u. 
But Mj and i>; are real numbers for which the commutative law holds, that is, 

«i + *i = v i + u i> t = 1, . . . , n 
Hence w + v = f + u since their corresponding components are equal. 

(v) Since u { + v { is the ith component oi u + v, + w 4 ) is the ith component of fe(tt + v). Since 
ku t and fefj are the ith components of fcit and lev respectively, ku { + kvi is the ith component 
of ku+ kv. But k, u { and v { are real numbers; hence 

k(u { + Vj) = fcitj + fc^j, i = 1, . . . , n 
Thus k(u + v) = ku + kv, as corresponding components are equal. 

(vi) Observe that the first plus sign refers to the addition of the two scalars k and k' whereas the 
second plus sign refers to the vector addition of the two vectors few and k'u. 

By definition, (fc + k')u { is the ith component of the vector (fe + k')u. Since few, and k'u { 
are the ith components of ku and k'u respectively, fcitj + fc'wj is the ith component of ku + k'u. 
But k, k' and u t are real numbers; hence 

(fe + — kui + fe'Mj, t = 1, . . . , n 
Thus (fe + k')u = ku + k'u, as corresponding components are equal. 

(vii) Since fe'itj is the ith component of k'u, k(k'u t ) is the ith component of fc(fe'zt). But (fcfc'K is the 
ith component of (kk')u and, since fe, fe' and w 4 are real numbers, 

(fefc')ttj = k(k'Ui), i = 1, . . . , n 
Hence (fefe')w = fe(fe'w), as corresponding components are equal. 

(viii) 1 • u = l(u v u 2 , u n ) = (Imx, lu 2 , lu n ) = (u lt u z , . . . , u n ) = u. 

1.7. Show that Ou = 0 for any vector u, where clearly the first 0 is a scalar and the second 
0 a vector. 

Method 1: 0m = 0(%, w 2 , . . . , u n ) = {0u lt 0« 2 , . . . , 0w„) = (0,0 0) = 0 

Method 2: By Theorem 1.1, 0u = (0 + 0)w = Ou + 0w 

Adding -Ou to both sides gives us the required result. 

DOT PRODUCT 

1.8. Compute u • v where: (i) u = (2, -3, 6), v = (8, 2, -3); (ii) u = (1, -8, 0, 5), v = (3, 6, 4); 
(iii) « = (3,-5,2,l), (4, 1,-2, 5). 

(i) Multiply corresponding components and add: u • v = 2*8+ (—3) • 2 + 6 • (—3) = —8. 

(ii) The dot product is not defined between vectors with different numbers of components. 

(iii) Multiply corresponding components and add: u • v = 3 • 4 + (—5) • 1 + 2 • (—2) + 1*5 = 8. 
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1.9. Determine fc so that the vectors u and v are orthogonal where 

(i) u = (1, k, -3) and v = (2, -5, 4) 

(ii) u = (2, 3fc, -4, 1, 5) and v = (6, -1, 3, 7, 2fc) 

In each case, compute u • v, set it equal to 0, and solve for fc. 

(i) wv = 1 • 2 + k • (-5) + (-3) ■ 4 = 2 - 5fc - 12 = 0, -5fc - 10 = 0, fc = -2 

(ii) »•« = 2*6 + 3*«(-l) + (-4)«3 + 1*7 + 5«2fc 

= 12 - 3k - 12 + 7 + 10fe = 0, k = -1 

1.10. Prove Theorem 1.2: For any vectors tt,v,wGR" and any scalar fc G B, 

(i) (w + • w — u • w + v ' w (iii) wv = vu 

(ii) (ku)-v = k(wv) (iv) - 0, and wu =0 iff u = 0 

Let tt = (tl„ M 2 u n)> v = v 2» • ■ ■ » ^n). w - («lf w 2> • • • » w n)- 

(i) Since tt + t; = (it t + u„ u 2 + v 2 , . . ., u n + v n ), 

(U + V)'W = (tlx + «!)»! + (M2 + V 2) w 2 + ' ' " + ( M n + V n )w n 

= u 1 w l + v 1 w 1 + u 2 w 2 + v 2 w 2 + • • ■ + u n w n + v n w n 

= («!««! + u 2 w 2 H + u n w n ) + (v 1 w l + v 2 w 2 + • • • + v n w n ) 

= U ' w + v • w 

(ii) Since ku = (ku lt ku 2 , . .., ku n ), 

(few) • v = ku-fVi + ku 2 v 2 + • • • + ku n v n - k(u 1 v 1 + u 2 v 2 + •■■ + u n v n ) - k(u • v) 

(iii) u - v = u 1 v l + u 2 v 2 + ■ • • + u n v n - v t u t + + • • • + v n u n = v • u 



(iv) Since wf is nonnegative for each i, and since the sum of nonnegative real numbers is non- 
negative ' wu = «? + «!+ ••• +u* * 0 

Furthermore, w • u — 0 iff w { — 0 for each i, that is, iff u = 0. 



DISTANCE AND NORM IN R" 

1.11. Find the distance d{u, v) between the vectors u and v where: (i) u = (1, 7), v = (6, -5); 
(ii) «=(3,-5,4), v = (6,2,-1); (iii) u = (5, 3, -2, -4, -1), v = (2,-1,0,-7,2). 



In each case use the formula d(u, v) = V («i _ v i) z + * + ( M n ~~ '"n) 2 • 



(i) d^) = V(l - 6) 2 + (7 + 5) 2 = V25 + 144 = V169 = 13 

(ii) d(w,v) = V(3 - 6)2 + (-5 - 2)2 + (4 + 1)2 = V9 + 49 + 25 = a/83 

(iii) d(u,t>) = V(5 - 2)2 + (3 + 1)2 + (-2 + 0)2 + (-4 + 7)2 + (-1 - 2)2 = V47" 

1.12. Find k such that d(u, v) = 6 where u = (2, fc, 1, -4) and v = (3, -1, 6, -3). 

(d(u,v))* = (2-3)2 + (fe + 1)2+ (1-6)2 + (-4 + 3)2 = fc2 + 2fe + 28 
Now solve fc 2 + 2fc + 28 = 6 2 to obtain k = 2, -4. 

1.13. Find the norm ||u|| of the vector u if (i) u = (2, -7), (ii) u = (3, -12, -4). 

In each case use the formula ||m|| = V"? + u| + • • • + «| . 

(i) |H| = \/22 + (-7)2 = a/4 + 49 = a/53 

(ii) || u || = a/3 2 + (-12)2 + (_ 4 )2 = V9 + 144 + 16 = \/l69 = 13 
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1.14. Determine & such that = \/39 where u = (1, k, -2, 5). 

|| M || 2 = l 2 + k 2 + (— 2) 2 + B 2 = A; 2 + 30 
Now solve fc 2 + 30 = 39 and obtain k = 3, -3. 

1.15. Show that ||u|| ^ 0, and ||u|| = 0 iff u = 0. 

By Theorem 1.2, tt-w^O, and wu = 0 iff u = 0. Since = yjwu, the result follows. 

1.16. Prove Theorem 1.8 (Cauchy-Schwarz): 

For any vectors u = (ui, . . . , u„) and v = (vi, . . . , v„) in B", \wv\^ \\u\\ \\v\\. 

n 

We shall prove the following stronger statement: \wv\ — ^£ k v i| — IHI IHI- 

If it = 0 or v = 0, then the inequality reduces to 0—0 — 0 and is therefore true. Hence we 
need only consider the case in which u 0 and v # 0, i.e. where |]u|| # 0 and ¥• 0. 

Furthermore, 

\U'V\ = |«!t»i + • • • + tt„t> B | - + + KvJ = SkVjl 

Thus we need only prove the second inequality. 

Now for any real numbers x, y G R, 0 — (x — y) 2 = x 2 — 2xy + y 2 or, equivalently, 

2xy ^ x 2 + j/ 2 (J) 
Set « = |Mj|/||M|| and y = H|/|H| in (1) to obtain, for any i, 

„ kl N _ kl 2 , HI 2 



IHI IMI IMP IHI 2 



(2) 



But, by definition of the norm of a vector, ||w|| = 2«f = 2 kl 2 and \\v\\ = 2*>? = 2k| 2 - Thus 
summing (2) with respect to i and using = |w,| we have 

,2W ^ 2kl 2 2 hi 2 W W 

IMI IHI IHI 2 IHI 2 IHI 2 IHI 2 
2 k^i! 

that is, ,, I, ■■ ,, - 1 

IHI IHI 

Multiplying both sides by ||tt|| we obtain the required inequality. 

1.17. Prove Minkowski's inequality: 

For any vectors u - (ui, . . .,%„) and v = (vi, . . .,v n ) in R", ||« + v\\ — \\u\\ + \\v\\. 

If + v\\ = 0, the inequality clearly holds. Thus we need only consider the case ||m + v|| # 0. 
Now + Vj| — + for any real numbers u it v t 6 R. Hence 

|| M + *H 2 = 2k+f<) 2 = 2k+* 1 | a 

= 2 K + »d k + «il - 2 k + «d (kl + HI) 
= 2 k+«d kl + 2 k+«tl kl 

But by the Cauchy-Schwarz inequality (see preceding problem), 

2k+»ilkl - ll«+«IIIMI and Sk+«,|N * ll* + HIIHI 

Thus \\u + v\\ 2 - \\u + v\\\\u\\ + \\u + v\\\\v\\ = \\u + v\\(\\u\\ + \\v\\) 

Dividing by ||w + v||, we obtain the required inequality. 
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1.18. Prove that the norm in R" satisfies the following laws: 
[Ni]: For any vector u, ||m||^0; and = 0 iff u — 0. 
[N 2 ]: For any vector u and any scalar k, \\ku\\ = \k\ \\u\\. 
[N 3 ]: For any vectors u and v, \\u + v\\ ^ \\u\\ + \\v\\. 

[N x ] was proved in Problem 1.15, and [N 3 ] in Problem 1.17. Hence we need only prove that 
[N 2 ] holds. 

Suppose u = (u lt I*?, . . . , u n ) and so fat = (fatj, ku^, fat„). Then 

[|fat|| 2 = (fatO 2 + (fat 2 ) 2 + ••• + (fat„) 2 = khL\ + khl\ + •■■ + Wu\ 
= & 2 (ttf + t4+-"+M 2 ) = fc 2 ||lt|| 2 

The square root of both sides of the equality gives us the required result. 
COMPLEX NUMBERS 

1.19. Simplify: (i) (5 + 3i)(2-7i); (ii) (4-3i) 2 ; (iii) —^T' < iv ) ItS ; (v) *' 
(vi) (l + 2i) 3 ; (vii) (g^si) 2 - 

(i) (5 + 3i)(2 - 7t) = 10 + 6i - 35i - 21P = 31 - 29t 

(ii) (4-3i) 2 = 16-24t + 9i 2 = 7-24i 

..... 1 _ (3 + 4i) _ 3 + 4i JL + ±i 
(m > 3-4i (3-4i)(3 + 4i) 25 25 25 

2 - li (2 - 7t)(5 - Si) _ -11 - 41i _ _H_41. 

(1V) 5 + 3t (5 + 3i)(5 - 3i) 34 34 34 

(v) t 8 = = (-l)i = -t; i* = i 2 '* 2 = 1; i 31 = (i 4 ) 7 '* 3 = 1 7> M) = -t 

(vi) (1 + 2i) 3 = 1 + 6i + 121 2 + 8*3 = 1 + 6i - 12 - 8i = -11 - 2t 

. ... / 1 V 1_ _ (-5 + 12t) _ -5 + 12t = B_ 12 . 

(vn) [^2 -Si J ~ -5-12* ~" (-6 - 12i)(-5 + 12i) 169 169 169 

1.20. Let z-2-Si and w = 4 + 5i. Find: 

(i) z + w and zw; (ii) z/w; (iii) z and w; (iv) |z| and \w\. 

(i) « + w = 2 - 3i + 4 + 5i = 6 + U 

zw = (2 - 3i)(4 + 5i) = 8 - 12i + lOi - ISi 2 = 23 - 2i 

r-\ L - 2 ~ 3i - (2 ~ 3*)(4 ~ _ -7 - 22i _ _ _7 _ 22 . 
w ~ 4 + hi ~ (4 + 5i)(4 - 5i) ~ 41 41 41 ' 

(iii) Use a+bi = a-bi: z = 2 -Si = 2 + Si; w = 4 + 5i = 4 - 5i. 

(iv) Use |o+ W| = Va 2 + b 2 : 1*1 = |2-3i| = V4 + 9 = ^13; M = |4 + 5t| = V16 + 26 = V^l- 



1.21. Prove: For any complex numbers z, w e C, 

(i) z + w = z + w, (ii) zw = zw, (iii) z = z. 

Suppose z = a+bi and ic = e + di where a, b, c, d G R. 

(i) z~+w = (a+bi) + (e + di) = {a+e) + (b + d)i 

= (a + e) — (b + d)i = a + c — bi — di 
= (a-bi) + (e-di) = z + w 

(ii) zw = (a + bi)(o + di) = (ae - bd) + (ad + be)i 

= (etc — bd) — (ad + be)i = (a - bi)(e — di) — zw 

(iii) 1 — a+bi — a — bi = a — (~b)i = a + bi = z 
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1.22. Prove: For any complex numbers z,wGC, \zw\ = \z\ \w\. 

Suppose z = a + bi and w = c + di where a, b,c,dG R. Then 

|z[ 2 = a 2 + 6 2 , |w| 2 = + and zw = (ac - bd) + (ad + bc)i 

Thus |z«>| 2 = (ac-6d)2+ (ad+fcc) 2 

= a 2 c 2 - 2abcd + b*cP + a 2 d 2 + 2abcd + Wc 2 

= a 2 (c 2 + d2) + o^ + d 2 ) = (a 2 + 6 2 )(c 2 + d 2 ) = lz| 2 |w| 2 

The square root of both sides gives us the desired result. 

1.23. Prove: For any complex numbers z,wGC, \z + w\ — \z\ + \w\. 

Suppose z = a + bi and w = c + di where a,b,e,dG. R. Consider the vectors u = (a, b) and 
v = (c, d) in R 2 . Note that 

\z\ = Va 2 + & 2 = ||u||, |w| = Vc 2 + d 2 = ||t>|| 
and \z + w\ = \(a + e) + (b + d)i\ = v> + c) 2 + (6 + d) 2 = ||(o + c, 6 + d)\\ = \\u + v\\ 
By Minkowski's inequality (Problem 1.17), Hm + vH ^ ||u|| + \\v\\ and so 

\z + w\ = \\u + v\\ * INI + INI = \z\ + \w\ 

VECTORS IN C- 

1.24. Let u = (3 - 2t, 4i, 1 + 6i) and v = (5 + i, 2 - 3i, 5). Find: 
(i) m + v, (ii) 4tu, (iii) (1 + (iv) (1 - 2i)u + (3 + 

(i) Add corresponding components: u + v = (8 — i, 2 + i, 6 + 6i). 

(ii) Multiply each component of u by the scalar 4i: 4iu = (8 + 12i, —16, —24 + 4i). 

(iii) Multiply each component of v by the scalar 1 + i: 

(l + t> = (5 + 61 + i 2 , 2-i-3i2, 5 + 5i) = (4 + 6?, 5 - i, 5 + 6i) 

(iv) First perform the scalar multiplication and then the vector addition: 

(1 - 2i)u + (3 + i)v = (-1 - 8i, 8 + 4i, 13 + 4i) + (14 + 8i, 9 - 7i, 15 + 5i) 
= (13, 17 -3i, 28 + 9t) 

1.25. Find and vu where: (i) u = (l-2i, 3 + i), r = (4 + 2i, 5-6i); (ii) u = 
(3-2i,4i, l + 6i), v = (5 + i, 2-3i, 7 + 2i). 

Recall that the conjugates of the second vector appear in the dot product: 
(*i> -..,«„)• • • w„) = + • • ■ + z„w„ 

(i) u • v = (1 - 2t0(4+~2l) + (3 + i)(5^6?) 

= (1 - 2i)(4 - 2i) + (3 + i)(5 + 6t) = -10* + 9 + 23i = 9 + 13i 

vu - (4 + 2i)(l - 2i) + (5 - 6t)(3 + i) 

= (4 + 2i)(l + 2i) + (5-6i)(3-i) = lOi + 9 - 23i = 9 - 13i 

(ii) u • v = (3 - 2i)(5 + i) + (4i)(2 - 3i) + (1 + 6i)(7 + 2i) 

= (3 - 2i)(5 - i) + (4i)(2 + St) + (1 + 6t)(7 - 2i) = 20 + 35i 

• w. = (5 + i)(3 - 2i) + (2 - 3t)(4i) + (7 + 2t)<T+lH) 

= (5 + i)(3 + 2i) + (2 - 3i)(-4t) + (7 + 2t)(l - 6i) = 20 - 35i 

In both examples, vu — w v. This holds true in general, as seen in Problem 1.27. 
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1.26. Find \\u\\ where: (i) u = (3 + 4i, 5 - 2i, 1 - 3i); (ii) u = (4 - i, 2i, 3 + 2i, 1 - hi). 

Recall that zz — a 2 + b 2 when z = a + bi. Use 

||m|| 2 = wu = + + • • • + z n z n where z = (z u z 2 z n ) 

(i) ||m|P = (3)2 + (4)2 + (5)2 + (-2) 2 + (1)2 + (-3)2 = 64, or ||«|| = 8 

(ii) 1|m||2 = 42 + (-1)2 + 22 + 32 + 22 + 12 + (-5)2 = 60, or |H| = V60 = 2\/T5 

1-27. Prove: For any vectors u, v £ C n and any scalar z G C, (i) u • v = vu, (ii) (zu) • v = 
z(u • v), (iii) u • (zv) = z(u • v). (Compare with Theorem 1.2.) 

Suppose u = (z v z 2 , z n ) and v = (w,, w 2 , . . . , w„). 

(i) Using the properties of the conjugate established in Problem 1.21, 

VU - Wfa + W 2 Z 2 + • • • + «7„Z„ = Wp! + Wp 2 + • • • + «^l„ 

= Wfa + w 2 z 2 + • • • + w n z n = z 1 w 1 + z 2 w 2 + • • ■ + z n w n — 11* V 

(ii) Since zu = {zz v zz 2 , . . . , zz n ), 

(zu) • v — zz l w l + zz 2 w 2 + ■ ■ ■ + zz n w n = z(z 1 w 1 + z 2 w 2 + • • • + z n w n ) = z(u • v) 

(iii) Method 1. Since zv — (zw lt zw 2 , . . . , zw n ), 

u • (zv) = z 1 zw l + z 2 zw 2 + • • • + z n zw n — z 1 zw 1 + z 2 zw 2 + • • • + Z n ZW n 
= z(z 1 w l + z 2 w 2 + ■■• + z n w n ) = z(u • V) 

Method 2. Using (i) and (ii), 

u • (zv) = (zv) ' u = z(v • u) = 2 (v • u) = z (u • v) 

MISCELLANEOUS PROBLEMS 

1.28. Let u = (3, -2, 1, 4) and v = (7, 1, -3, 6). Find: 

(i) u + v; (ii) 4u; (iii) 2u - Sv; (iv) wv; (v) and (vi) d{u, v ). 

(i) u + v = (3 + 7,-2 + 1,1-3,4 + 6) = (10,-1,-2,10) 

(ii) 4u = (4 • 3, 4 • (-2), 4 • 1, 4 • 4) = (12, -8, 4, 16) 

(iii) 2u-3v = (6, -4, 2, 8) + (-21, -3, 9, -18) = (-15, -7, 11, -10) 

(iv) u • v = 21-2-3 + 24 = 40 

(v) ||u|| = V9 + 4 + 1 + 16 = V30, \M\ = V49 + 1 + 9 + 36 = v^5 

(vi) d(u,v) = V(3 - 7)2 + (-2 - 1)2 + (1 + 3)2 + (4 - 6)2 = V45 = 3^ 

1.29. Let u = (7 - 2i, 2 + 5i) and v = (1 + i, -3 - 6i). Find: 
(i) u + v; (ii) 2m; (iii) (3 - i)v; (iv) wv; (v) ||u|| and 

(i) u + v = (7-2i+l + i, 2 + 5i-3-6i) = (8 — i, — 1 — i) 

(ii) 2iu = (14i-4i2, 4t+10i2) = (4 + 14i, -10 + 4i) 

(iii) (3-i)v = (3 + Zi-i-P, -9 - 18i + 3t + 6?) = (4 + 2t, -15 - 15i) 

(iv) u* v — (7 - 2i)(TTt) + (2 + 5i)(-3 - 6t) 

= (7 - 2i)(l - 1) + (2 + 5i)(-3 + 6i) = 5 - 9i - 36 - 3i = -31 - 12i 

(v) = V7 2 + (-2)2 + 22 + 52 = V§2, IHI = VI 2 + I 2 + (-3) 2 + (-6)2 = V47 
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1.30. Any pair of points P = (cu) and Q = (h) in R n de- 
fines the directed line segment from P to Q, written 

PQ. We identify PQ with the vector v = Q-P: 
— > 

PQ = v = (bi-a u b 2 -a 2 , . . ., b n -dn) 

Find the vector v identified with PQ where: 

(i) P = (2,5), Q = (-3,4) 

(ii) P =(1,-2,4), Q = (6,0,-3) 

(i) v = Q-P = (-3-2,4-5) = (-5,-1) 

(ii) „ = Q-P = (6-1,0 + 2,-3-4) = (5,2,-7) 

1.31. The set H of elements in R" which are solutions of a linear equation in n unknowns 
x\, . . . , x n of the form 

CiXl + C 2 X 2 + • • • + CnX n = b (*) 

with u = (ci, . . . , c„) ¥= 0 in R", is called a kyperplane of R n , and (*) is called an equa- 
tion of (We frequently identify H with (*).) Show that the directed line segment 
PQ of any pair of points P,QgH is orthogonal to the coefficient vector u; the vector 
u is said to be normal to the hyperplane H. 

Suppose P = (a u . . . , a n ) and Q = (6 1( . . . , 6 n ). Then the a s and the b t are solutions of the 
given equation: 

+ c 2 a 2 + ■ ■ ■ + c n a n = b, c 1 b 1 + c 2 b 2 + • • • + c n b„ = b 
^ V = FQ = Q-P = (&!-«!, 63-02, .... 6„-o„) 

Then m • v = c^bj - Oj) + c 2 (6 2 - ©2) + • • • + e n (b n - a n ) 

= c l b 1 — Cittj + c 2 6 2 - «2 a 2 + • • • + c n b n - c n a n 

= (c^ + c 2 6 2 + • • • + c n b n ) - (Cjtt! + c 2 a 2 + • • • + c„a„) = 6-6 = 0 
Hence that is, PQ, is orthogonal to u. 

1.32. Find an equation of the hyperplane H in R 4 if: (i) H passes through P = (3, -2, 1, -4) 
and is normal to u = (2,5,-6,-2); (ii) H passes through P = (1,-2, 3, 5) and is 
parallel to the hyperplane H' determined by 4x - 5y + 2z + w = 11. 

(i) An equation of H is of the form 2x + 5y — 6z - 2w = k since it is normal to it. Substitute 
P into this equation to obtain k = —2. Thus an equation of H is 2x + 5y — 6z — 2w = —2. 

(ii) H and ii 7 are parallel iff corresponding normal vectors are in the same or opposite direction. 
Hence an equation of H is of the form 4x — 5y + 2z + w = k. Substituting P into this equa- 
tion, we find k = 25. Thus an equation of H is 4a; — 5y + 2z + w = 25. 

1.33. The line I in R" passing through the point P = (a 4 ) 
and in the direction of u = (ui) ¥- 0 consists of the 
points X — P + tu, t G R, that is, consists of the 
points X = (xi) obtained from 

Xi = Ci + Uit 

Xi — a 2 + 



x„ = a n + Unt 

where t takes on all real values. The variable t is 
called a parameter, and (*) is called a parametric rep- 
resentation of i. 
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(i) Find a parametric representation of the line passing through P and in the direc- 
tion of u where: (a)P = (2,5) and w = (-3,4); (b) P = (4, -2, 3, 1) and u = 
(2,5,-7,11). 

(ii) Find a parametric representation of the line passing through the points P and Q 
where: (a)P = (7,-2) and Q = (9,3); (6) P = (5, 4, -3) and Q = (l,-3,2). 



(i) In each case use the formula (*). 



(x = 2-3* 
ia) \y = 5 + 4* 



(6) 



x = 4 + 2t 

y = -2 + 5t 

z = 3 - 7t 

w = 1 + lit 



(In K 2 we usually eliminate t from the two equations and represent the line by a single 
equation: 4* + Zy = 23.) 



(ii) First compute u — PQ = Q — P. Then use the formula (*). 

(o) w = Q-P = (2,5) (6) it = Q-P = (-4,-7,5) 

a = 7 + 2* [-» = 5 - 4t 

V = -2 + 5t \ y = 4 - 7t 



{ 



z = -3 + 5f 

(Note that in each case we could also write u = QP = P — Q.) 



Supplementary Problems 

VECTORS IN R» 

1.34. Let it = (1,-2, 5), v = (3,1,-2). Find: (i) it + v; (ii) -6u; (iii) 2u - 5v; (iv) wv; (v) 
and (vi) d(u,v). 

1.35. Let u — (2, 1, 0, 3), d = (1,-1,-1,3), w = (1, 3, -2, 2). Find: (i) 2u - 3v; (ii) 5m - 3i> - 4w; 
(iii) — it + 2v — 2w; (iv) u • v, it • w and v • w; (v) d(M,») and d(r,w). 

1.36. Let it = (2,1,-3,0,4), « = (5, -3, -1, 2, 7). Find: (i) it + v; (ii) 3u - 2v; (iii) (iv) ||u|| 
and (v) d(u,v). 



1.37. Determine k so that the vectors it and v are orthogonal, (i) it = (3, k, — 2), v = (6, — 4, — 3). (ii) u~ 
(5,k, -4, 2), « = (1, -3, 2, 2fc). (iii) it = (1, 7, fe + 2, -2), « = (3, k, -3, fc). 

1.38. Determine x and j/ if: (i) (x,x + y) = (y-2,6); (ii) x(l, 2) = -4(y, 3). 

1.39. Determine a; and y if: (i) *(3,2) = 2(j/,-l); (ii) x(2, y) = y(\, -2). 

1.40. Determine a;, j/ and z if: 

(i) (3,-1,2) - X (L, 1, 1) + j/d, -1, 0) + *(1, 0, 0) 

(ii) (-1,3,3) = ss(l, 1, 0) + y(0, 0, -1) + z(0, 1, 1) 



1.41. 



Let e t = (1,0,0), e 2 = (0, 1, 0), e s = (0,0,1). Show that for any vector u = (a, b,c) in R 3 : 
(i) m = ae 1 + be 2 + ce 3 ; (ii) it • e x = a, it • e 2 = 6, w • e 3 = c. 
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1.42. Generalize the result in the preceding problem as follows. Let e i G R n be the vector with 1 in the 
ith coordinate and 0 elsewhere: 

e, = (1,0,0, ...,0,0), e 2 = (0,1,0, .. .,0,0), e„ = (0, 0,0, . . .,0, 1) 

Show that for any vector u — (a x ,a%, — ,«„), 

(i) u = a 1 e 1 + a 2 e 2 + ••• + a n e n , (ii) it • e t .= a t for i = 1, . . .,n 

1.43. Suppose u G R" has the property that m • v = 0 for every u e R™. Show that u = 0. 

1.44. Using d(tt,i>) = ||m-d|| and the norm properties [WJ, [W 2 ] and [JV 3 ] in Problem 1.18, show that 
the distance function satisfies the following properties for any vectors u,v,wG R»: 

(i) d(u,v)^0, and d(u, v) = 0 iff u = v; (ii) d(u,v) = d(v,u); (iii) d(u, w) ^ d(u, v) + d{v, w). 
COMPLEX NUMBERS 

1.45. Simplify: (i) (4 - 7i)(9 + 2i); (ii) <3-5i)2; (iii) ; ( v > d - *") 8 - 

1.46. Simplify: (i) ^; (ii) y^ff; (iii) i 15 , t^ 5 , (iv) (g^)*- 

1.47. Let z = 2-5i and w = 7 + 3i. Find: (i) z + to; (ii) zw; (iii) z/w; (iv) z, w; (v) |*|, |w|. 

1.48. Let z = 2 + t and w = 6 - 5i. Find: (i) z/w; (ii) z, w; (iii) |z|, |w| . 

1.49. Show that: (i) zz~i = 1; (ii) |z| = |z|; (iii) real part of z = £(z + z); (iv) imaginary part of 

* = (z - 8)/2i 

1.50. Show that zw = 0 implies z = 0 or w = 0. 
VECTORS IN C" 

1.51. Let u = (1 + 7i, 2 - 6i) and v = (5 - 2i, 3 - 4i). Find: (i) m + r; (ii) (3 + t>; (iii) 2iu + (4 - H)v; 
(iv) wvand-ww; (v) ||w|| and 

1.52. Let m = (3 - 7i, 2i, -1 + i) and v = (4 - i, 11 + 2i, 8 - 3t). Find: (i) it - i>; (ii) (3 + t>; (iii) 
u*vandvu; (iv) ||w|| and . 

1.53. Prove: For any vectors u,v,w G C n : 

(i) (m + v) • w = u • w + v • w, (ii) w (u + v) = wu + wv. (Compare with Theorem 1.2.) 

1.54. Prove that the norm in C n satisfies the following laws: 
[iVj]: For any vector u, \\u\\ ^ 0; and ||w|| =0 iff u = 0. 

[N 2 ]- For any vector u and any complex number z, ||zw|| = |z| ||w||. 
[2V S ]: For any vectors it and v, ||u + v\\ - \\u\\ + \\v\\. 
(Compare with Problem 1.18.) 

MISCELLANEOUS PROBLEMS 

1.55. Find an equation of the hyperplane in R 3 which: 

(i) passes through (2, —7, 1) and is normal to (3, 1, —11); 

(ii) contains (1, -2, 2), (0, 1, 3) and (0, 2, -1); 

(iii) contains (1, -5, 2) and is parallel to 3a; - 1y + 4z = 5. 

1.56. Determine the value of k such that 2x - ky + 4z - 5w = 11 is perpendicular to 1x + 2y - z + 
2w = 8. (Two hyperplanes are perpendicular iff corresponding normal vectors are orthogonal.) 
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1.57. Find a parametric representation of the line which: 

(i) passes through (7, —1, 8) in the direction of (1, 3, —5) 

(ii) passes through (1,9,-4,5) and (2,-3,0,4) 

(iii) passes through (4,-1,9) and is perpendicular to the plane 3« — 2y + z = 18 

1.58. Let P, Q and R be the points on the line determined by 

*1 = °1 + M l*f a; 2 = «2 + tt 2 t, • • • > x n — a n + M n* 

which correspond respectively to the values t lf t 2 and t s for t. Show that if t t < t z < t 3 , then 
d(P,Q) + d(Q,R) = d(P,R). 

Answers to Supplementary Problems 

1.34. (i) u + v = (4, -1, 3); (ii) -6u = (-6, 12, -30); (iii) 2it - 5v = (-13, -9, 20); (iv) m • v = -9; 
(v) ||tt|| = V30, \\v\\ = Vl4; (vi) d(u,v) = V&2 

1.35. (i) 2u-3v = (1, 1, 3, -15); (ii) 5u - 3« - 4w = (3, -14, 11, -32); (iii) -w + 2t> - 2w = (-2,-7, 2, 5); 

(iv) m • v = —6, tt ■ w = —7, d • w = 6; (v) d(t«, ») — V38 , d(v, w) = 3^2 

1.36. (i) u + v = (7, -2, -4, 2, 11); (ii) 3m - 2v = (-4, 9, -7, -4, -2); (iii) u • » = 38; (iv) = V30 , 
|H| =2^/22; (v) d(u,«) = V4l 

1.37. (i) A; = 6; (ii) fc = 3; (iii) k = 3/2 

1.38. (i) x = 2, y — 4; (ii) a; = -6, « = 3/2 

1.39. (i) x = —1, y = -3/2; (ii) x = 0, j/ = 0; or * = -2, y = -4 

1.40. (i) x = 2, j/ = 3, z = -2; (ii) x = -1, » = 1, z = 4 
1.43. We have that w • it = 0 which implies that u = 0. 

1.45. (i) 50 - 55i; (ii) -16 - 30i; (iii) (4 + 7i)/65; (iv) (1 + 3i)/2; (v) -2 - 2t. 

1.46. (i) (ii) (5 + 27i)/58; (iii) -i, i, -1; (iv) (4 + 3i)/50. 

1.47. (i) z + «; = 9 - 2i; (ii) zw = 29 - 29i; (iii) z/w = (-1 - 41i)/58; (iv) z = 2 + 5i, w = 7 - 3i; 

(v) \z\ = V29, M = V58. 

1.48. (i) z/w = (7 + 16t)/61; (ii) z = 2 - i, w = 6 + 5t; (iii) |z| = \/5 , |w| = \/61 . 

1^0. If zw = 0, then \zw\ = |z| |w| = |0| = 0. Hence \z\ = 0 or |w| = 0; and so z = 0 or w = 0. 

1.51. (i) u + v = (6 + 5i, 5 - lOi) (iv) m • r = 21 + 27i, v • u = 21-21% 

(ii) (3 + i)u = (-4 + 22i, 12 - 16t) (v) ||w|| = 3\/l0 , ||v|| = 3^6 

(iii) 2iu + (4 - 7i)v = (-8 - 41i, -4 - 33i) 

1.52. (i) u - v = (-1 - Si, -11, -9 + 4i) (iii) wv - 12 + 2i, vu = 12 - 2i 
(ii) (3 + 1> = (13 + i, 31 + 17t, 27 - i) (iv) |H| = 8, \\v\\ = \/215 

1.55. (i) 3x + y-llz = -12; (ii) 13« + 4y + z = 7; (iii) 3* - 7j/ + 4« = 46. 

1.56. k = 0 



1.57. (i) 



= 7 + t 


(ii) 


x = 


1 + 1 


= -1 + 3t 




» = 


9 - 12* 


= 8-5* 


- 


z = 


-4 + 4t 






w = 


5 - t 



(iii) 




Chapter 2 



Linear Equations 

INTRODUCTION 

The theory of linear equations plays an important and motivating role in the subject 
of linear algebra. In fact, many problems in linear algebra are equivalent to studying a 
system of linear equations, e.g. finding the kernel of a linear mapping and characterizing 
the subspace spanned by a set of vectors. Thus the techniques introduced in this chapter 
will be applicable to the more abstract treatment given later. On the other hand, some of 
the results of the abstract treatment will give us new insights into the structure of "con- 
crete" systems of linear equations. 

For simplicity, we assume that all equations in this chapter are over the real field R. We 
emphasize that the results and techniques also hold for equations over the complex field C 
or over any arbitrary field K. 



LINEAR EQUATION 

By a linear equation over the real field R, we mean an expression of the form 

aiZi + a%Xi + ■ ■ ■ + a n x n = b (1) 

where the o 4 , b € R and the x t are indeterminants (or: unknowns or variables). The scalars 
di are called the coefficients of the Xi respectively, and b is called the constant term or simply 
constant of the equation. A set of values for the unknowns, say 

Xl = ki, Xz = k 2 , . . . , Xn = kn 

is a solution of (1) if the statement obtained by substituting Id for x it 

aiki + O2&2 + • • • + a n k n = b 
is true. This set of values is then said to satisfy the equation. If there is no ambiguity 
about the position of the unknowns in the equation, then we denote this solution by simply 

then-tuple ... , . 

u = (ki, ki, . . . , kn) 

Example 2.1 : Consider the equation x + 2y — 4z + w = 3. 

The 4-tuple u = (3, 2, 1, 0) is a solution of the equation since 
3 + 2'2-4-l + 0 = 3 or 3 = 3 
is a true statement. However, the 4-tuple v = (1, 2, 4, 5) is not a solution of the 
equation since i + 2 . 2 _ 4 • 4 + 5 = 3 or -6 = 3 
is not a true statement. 

Solutions of the equation (1) can be easily described and obtained. There are three 
cases: 

Case (i) : One of the coefficients in (1) is not zero, say ai ¥> 0. Then we can rewrite the 
equation as follows 

a x xi = b - a 2 x 2 - • • • - OnX n or Xi = a^ l b - of 1 02*2 - • • • - of x a n x n 
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By arbitrarily assigning values to the unknowns x 2 , ...,x n> we obtain a value for xv, these 
values form a solution of the equation. Furthermore, every solution of the equation can 
be obtained in this way. Note in particular that the linear equation in one unknown, 

ax = b, with o^O 

has the unique solution x — a~ x b. 

Example 2.2: Consider the equation 2x — Ay + z — 8. 

We rewrite the equation as 

2b = 8 + Ay — z or x = 4 + 2y — \z 

Any value for y and z will yield a value for x, and the three values will be a solution 
of the equation. For example, let y = 3 and z = 2; then x = 4 + 2- 3 — ^-2 = 9. 
In other words, the 3-tuple u = (9, 3, 2) is a solution of the equation. 

Case (ii): All the coefficients in (1) are zero, but the constant is not zero. That is, the 
equation is of the form 

Occi + 0*2 + • • • + 0x„ = b, with b # 0 

Then the equation has no solution. 

Case (iii): All the coefficients in (1) are zero, and the constant is also zero. That is, 
the equation is of the form 

0*! + 0a; 2 + • • • + 0*„ = 0 
Then every w-tuple of scalars in R is a solution of the equation. 

SYSTEM OF LINEAR EQUATIONS 

We now consider a system of m linear equations in the n unknowns xi, . .., x n : 

dllXl + (112*2 + • • ■ + ainXn = i>l 

(121*2 + d22*2 + • * • + a 2 nX n = & 2 , . 



OmlXl + Om2*2 + • - • + OmnXn = 6m 

where the a«, bi belong to the real field R. The system is said to be homogeneous if the con- 
stants 61, . . . , b m are all 0. An rc-tuple u = (ki, . . . , k n ) of real numbers is a solution (or: 
a particular solution) if it satisfies each of the equations; the set of all such solutions is 
termed the solution set or the general solution. 

The system of linear equations 

dllXi + di2*2 + • • ■ + ttinXn — 0 

^21*1 + (122*2 + • • • + a2n*n =0 , . 

(*•) 

a m lXi + Om 2 *2 + • • • + OmnXn = 0 

is called the homogeneous system associated with (*). The above system always has a solu- 
tion, namely the zero ?z-tuple 0 = (0, 0, . . . , 0) called the zero or trivial solution. Any 
other solution, if it exists, is called a nonzero or nontrivial solution. 

The fundamental relationship between the systems (*) and (**) follows. 
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Theorem 2.1: Suppose u is a particular solution of the nonhomogeneous system (*) and 
suppose W is the general solution of the associated homogeneous system (**). 

^ ejl u + = {u + w: w G W} 

is the general solution of the nonhomogeneous system (*). 

We emphasize that the above theorem is of theoretical interest and does not help us to 
obtain explicit solutions of the system (*). This is done by the usual method of elimination 
described in the next section. 

SOLUTION OF A SYSTEM OF LINEAR EQUATIONS 

Consider the above system (*) of linear equations. We reduce it to a simpler system as 
follows: 

Step 1. Interchange equations so that the first unknown xi has a nonzero coeffi- 
cient in the first equation, that is, so that an ¥-0. 

Step 2. For each i > 1, apply the operation 

Li -* —auLi + anLi 

That is, replace the tth linear equation U by the equation obtained by mul- 
tiplying the first equation Li by —an, multiplying the ith equation L ( by 
an, and then adding. 

We then obtain the following system which (Problem 2.13) is equivalent to (*), i.e. has 
the same solution set as (*): 

auXi + a'\<2$i + a'13%3 + • • • + ai n x n = b[ 



au 2 Xj 2 



'32 '2 

where an ¥= 0. Here Xj 2 denotes the first unknown with a nonzero coefficient in an equation 
other than the first; by Step 2, x h ¥ xi. This process which eliminates an unknown from 
succeeding equations is known as (Gauss) elimination. 

Example 2.3: Consider the following system of linear equations: 

2* + Ay - z + 2v + 2w = 1 

3x + 6y + z — v + 4w = —7 

4x + &y + z + 5v — w = 3 

We eliminate the unknown * from the second and third equations by applying the 
following operations: 

L 2 -» -3L t + 2L 2 and L 3 -> -2Lj + L 3 

We compute —ZL^. -6se - VZy + 3z - 6v - Gw = -3 

2Z, 2 : 6x + 12y + 22 - 2v + Sw - -14 

-3L X + 2L 2 : hz-iv + 2w - -17 

and — 2L t : — 4x - 8y + 2z - iv - 4w - -2 

L 3 : 4* + 8y + z + 5v - w = 3 

-2Lj + L 3 : 3« + v - 5w = 1 



a'mu%io + + OmnXn — bm 
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Thus the original system has been reduced to the following equivalent system: 

2x + 4j/ — z + 2v + 2w = 1 

5z - 8v + 2w = -17 
3z + v — 5w = 1 

Observe that y has also been eliminated from the second and third equations. Here 
the unknown z plays the role of the unknown above. 

We note that the above equations, excluding the first, form a subsystem which has 
fewer equations and fewer unknowns than the original system (*). We also note that: 

(i) if an equation Osci + • • ■ + 0x n = b, b ¥> 0 occurs, then the system is incon- 
sistent and has no solution; 

(ii) if an equation Oxi + ■ • • + 0x n = 0 occurs, then the equation can be deleted 
without affecting the solution. 

Continuing the above process with each new "smaller" subsystem, we obtain by induction 
that the system (*) is either inconsistent or is reducible to an equivalent system in the 
following form 

O-llXi + 0,12X2 + (I13X3 + + CLlnXn = &1 

a>2i.Xj„ + a 2 ,i,+ iXu+i + + a.2nX n = b% 

(***) 



Grj T Xj r + a r ,j r +lXj r +l + * * • + UmXn — &r 

where 1 < 32 < • • • < jr and where the leading coefficients are not zero: 

an * 0, a 2j - 2 ^0, . . . , a rjr ¥=0 

(For notational convenience we use the same symbols an, b k in the system (***) as we used 
in the system (*), but clearly they may denote different scalars.) 

Definition: The above system (***) is said to be in echelon form; the unknowns x t which 
do not appear at the beginning of any equation (i 9* 1, j'2, . . . , jr) are termed 
free variables. 

The following theorem applies. 

Theorem 2.2: The solution of the system (♦**) in echelon form is as follows. There are 
two cases: 

(i) r — n. That is, there are as many equations as unknowns. Then the 
system has a unique solution. 

(ii) r <n. That is, there are fewer equations than unknowns. Then we 
can arbitrarily assign values to the n - r free variables and obtain a 
solution of the system. 

Note in particular that the above theorem implies that the system (***) and any equiv- 
alent systems are consistent. Thus if the system (*) is consistent and reduces to case (ii) 
above, then we can assign many different values to the free variables and so obtain many 
solutions of the system. The following diagram illustrates this situation. 
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System of linear equations 



Inconsistent 



Consistent 



No 
solution 



Unique 
solution 



More than 
one solution 



In view of Theorem 2.1, the unique solution above can only occur when the associated 
homogeneous system has only the zero solution. 



Example 2.4: We reduce the following system by applying the operations L 2 -* —31,! + 2L 2 and 
L 8 -» —SL 1 + 2L 3 , and then the operation L 3 -» — 3L 2 + W- 

2x + y — 2z + Zw = 1 2x + y - 2z + Zw = 1 2x + y - 2z + Zw = 1 
3x + 2y — z + 2w = 4 y + 4z — 5w = 5 y + 4z — 5w = 5 

3x + 3j/ + 3z - 3w = 5 3j/ + 12z - 15w = 7 0 = -8 

The equation 0 = -8, that is, 0x + 0y + 0z + 0w = -S, shows that the original 
system is inconsistent, and so has no solution. 



Example 2.5: We reduce the following system by applying the operations L 2 -» —L t + L 2 , 
L 3 -> — 2L X + L z and L 4 -♦ —21/! + L 4 , and then the operations L 3 -* L 2 — L 3 
and L 4 -> — 2L 2 + L 4 : 



a; + 2j/ - Zz - 


4 


x + 2y - 3z = 


4 


x + 2y - Sz 


= 4 


x + 3j/ + z = 


11 


y + 4z = 


7 


y + 4z 


= 7 


2* + 5y — 4z = 


13 


2/ + 2z = 


5 


2z 


= 2 


2x + 6y + 2z = 


22 


2j/ + 8z = 


14 


0 


= 0 



x + 2y - 3z = 4 
y + 4z = 7 
2z = 2 



Observe first that the system is consistent since there is no equation of the form 
0 = 6, with b ¥= 0. Furthermore, since in echelon form there are three equations 
in the three unknowns, the system has a unique solution. By the third equation, 
z = 1. Substituting z = 1 into the second equation, we obtain y = 3. Substitut- 
ing z = 1 and y = 3 into the first equation, we find x = 1. Thus x = 1, y = 3 
and z = 1 or, in other words, the 3-tuple (1, 3, 1), is the unique solution of the 
system. 



Example 2.6: We reduce the 4 ollowing system by applying the operations L 2 -» -2L X + L 2 and 
L 3 -» —5L t + L 3 , and then the operation L s -» — 2L 2 + L 3 : 

a; + 2j/ - 2z + 3w = 2 x + 2y - 2z + Zw = 2 x + 2y - 2z + Zw = 2 
2x + 4y - Zz + Aw = 5 z - 2w = 1 z - 2w = 1 

5x + 10j/ - 8z + llw = 12 2z - 4w = 2 0 = 0 



x + 2y - 2z + Zw = 2 
z - 2w = 1 
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The system is consistent, and since there are more unknowns than equations in 
echelon form, the system has an infinite number of solutions. In fact, there are 
two free variables, y and w, and so a particular solution can be obtained by giving 
y and w any values. For example, let w = 1 and y = —2. Substituting w = 1 
into the second equation, we obtain z = 3. Putting w = 1, z = 3 and y = — 2 
into the first equation, we find x — 9. Thus a; = 9, = -2, z = 3 and w = 1 or, 
in other words, the 4-tuple (9, -2, 3, 1) is a particular solution of the system. 

Remark: We find the general solution of the system in the above example as follows. 

Let the free variables be assigned arbitrary values; say, y = a and w = 6. 
Substituting w - b into the second equation, we obtain z-l+2b. Putting 
y = a, z = 1 + 26 and w = 6 into the first equation, we find x = 4 - 2a + b. 
Thus the general solution of the system is 

x = 4 - 2a + 6, y = a, z = 1 +2b, w — b 

or, in other words, (4 - 2a + 6, a, 1 + 26, 6), where a and 6 are arbitrary num- 
bers. Frequently, the general solution is left in terms of the free variables y 
and w (instead of a and 6) as follows: 

x = 4 - 2y + w, z = 1 + 2w or (4 - 2y + w, y, 1 + 2w, w) 

We will investigate further the representation of the general solution of a 
system of linear equations in a later chapter. 

Example 2.7: Consider two equations in two unknowns: 

a t x + &!# = c t 

a 2 x + b 2 y = c 2 

According to our theory, exactly one of the following three cases must occur: 

(i) The system is inconsistent. 

(ii) The system is equivalent to two equations in echelon form. 

(iii) The system is equivalent to one equation in echelon form. 

When linear equations in two unknowns with real coefficients can be represented 
as lines in the plane R 2 , the above cases can be interpreted geometrically as follows: 

(i) The two lines are parallel. 

(ii) The two lines intersect in a unique point. 

(iii) The two lines are coincident. 

SOLUTION OF A HOMOGENEOUS SYSTEM OF LINEAR EQUATIONS 

If we begin with a homogeneous system of linear equations, then the system is clearly 
consistent since, for example, it has the zero solution 0 = (0, 0, . . ., 0). Thus it can always 
be reduced to an equivalent homogeneous system in echelon form: 

anXi + (I12X2 + CI13X3 + + amXn = 0 

a2j 2 Xj 2 + a 2 ,j 2 +iXj 2 +i + + a 2n x„ = 0 



a r jXj r + a r ,i r +iXj r +i + • • • + a r nX n = 0 

Hence we have the two possibilities: 

(i) r = n. Then the system has only the zero solution. 

(ii) r <n. Then the system has a nonzero solution. 

If we begin with fewer equations than unknowns then, in echelon form, r < n and 
hence the system has a nonzero solution. That is, 
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Theorem 2.3: A homogeneous system of linear equations with more unknowns than 
equations has a nonzero solution. 

Example 2.8: The homogeneous system 

x + 2y - 3z + w = 0 
x - Sy + z -2w = 0 
2x + y - 3z + 5w = 0 
has a nonzero solution since there are four unknowns but only three equations. 

Example 2.9: We reduce the following system to echelon form: 

x + y - z = 0 x + y - z = 0 x + y - z = 0 

2x - Sy + z - 0 -5y + 3z = 0 -5j/ + 3z = 0 

x - 4y + 2z = 0 -5y + 3z = 0 

The system has a nonzero solution, since we obtained only two equations in the 
three unknowns in echelon form. For example, let z = 5; then y = 3 and x - 2. 
In other words, the 3-tuple (2, 3, 5) is a particular nonzero solution. 

Example 2.10: We reduce the following system to echelon form: 

x + y - z = o x + y - z = 0 x + y - z = 0 

2x + 4y- z = 0 2{/+z = 0 2y + z = 0 

3* + 2y + 2z = 0 —y + 5z = 0 llz = 0 

Since in echelon form there are three equations in three unknowns, the system has 
only the zero solution (0, 0, 0). 



Solved Problems 

SOLUTION OF LINEAR EQUATIONS 

2x - 3y + 6z + 2v - 5w = 3 
2.1. Solve the system: y - 4z + v = 1 . 

v-Sw = 2 

The system is in echelon form. Since the equations begin with the unknowns x,y and v re- 
spectively, the other unknowns, z and w, are the free variables. 

To find the general solution, let, say, z = a and w = 6. Substituting into the third equation, 

v - 36 = 2 or v = 2 + 36 

Substituting into the second equation* 

y - 4a + 2 + 36 = 1 or y = 4a - 36 - 1 

Substituting into the first equation, 

2a;-3(4a-36-l) + 6a + 2(2 + 36)- 56 = 3 or x = 3a - 56 - 2 

Thus the general solution of the system is 

x = 3a - 56 - 2, y = 4a - 36 - 1, z = a, v = 2 + 36, w = 6 
or (3a -56 -2, 4a -36-1, a, 2 + 36, 6), where a and 6 are arbitrary real numbers. Some texts 
leave the general solution in terms of the free variables z and w instead of a and 6 as follows: 



CHAP. 2] 



LINEAR EQUATIONS 



25 



x = 3z — 5w — 2 

y = 4z - 3w - 1 or (3z - 5w - 2, 42 - 3w - 1, z, 2 + 3w, w) 
v - 2 + Sw 

After finding the general solution, we can find a particular solution by substituting into the 
general solution. For example, let a = 2 and 6 = 1; then 

x = -1, y = 4, z = 2, v = 5, w = 1 or (-1, 4, 2, 5, 1) 

is a particular solution of the given system. 

X + 2y-3z = -1 
2.2. Solve the system: 3x - y + 2z = 7 . 

5x + Sy - 4z = 2 

Reduce to echelon form. Eliminate x from the second and third equations by the operations 
L 2 -» — 31,! + L 2 and L 3 -» — 5Lj + L 3 : 

-31,!: -3* - 6j/ + 9z = 3 -5Li: -5a; - lOy + Voz = 5 

L 2 : 3* - y + 2z = 7 L 3 : 5* + 3j/ - 4z = 2 



-31,! + £ 2 : + llz = 10 -5Li + L 3 : -7j/ + llz = 7 

Thus we obtain the equivalent system 

x + 2y - 3z = -1 

-1y + llz = 10 

-7y + llz = 7 

The second and third equations show that the system is inconsistent, for if we subtract we obtain 
0* + Oy + Qz = 3 or 0 = 3. 

2x+ y - 2z = 10 
2.3. Solve the system: Sx + 2y + 2z = 1 . 

5x + 4y + 3z= 4 

Reduce to echelon form. Eliminate x from the second and third equations by the operations 
L 2 -» -3Lj + 2L 2 and L 3 -* -6L, + 2L 3 : 

-3Z,!: -6* - 3j/ + 6z = -30 -5£i= -10a; - 5j/ + lOz = -50 

2Z, 2 : 6a; + 4y + 4z = 2 2L 3 : 10a; + Sy + 6z = 8 



-31,! + 2L 2 : y + lOz = -28 -5Lj + 2L 3 : Zy + 16z = -42 

Thus we obtain the following system from which we eliminate y from the third equation by the 
operation L 3 -» — 3L 2 + L 3 : 

2x + y- 2z = 10 2x + y- 2z - 10 

y + lOz = -28 to 2/ + lOz = -28 

3j/ + 16z = -42 -14z - 42 

In echelon form there are three equations in the three unknowns; hence the system has a unique 
solution. By the third equation, z = —3. Substituting into the second equation, we find y = 2. 
Substituting into the first equation, we obtain * = 1. Thus x = 1, y = 2 and z = -3, i.e. the 3-tuple 
(1, 2, —3), is the unique solution of the system. 
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x + 2y-Zz = 6 
2.4. Solve the system: 2x - y + 4z = 2 . 

4x + Zy-2z = 14 

Reduce the system to echelon form. Eliminate * from the second and third equations by the 
operations L 2 -* — 2L t + L 2 and L a -» —AL t + L s : 

-2Lj: -2x -4y + 6z = -12 -4L X : -4x - %y + 12z = -24 

L 2 : 2* - y + 4z = 2 L 3 : 4a; + 3y - 2z = 14 



-5j/ + lOz = -10 -5j/ + lOz = -10 

or y - 2z = 2 or 3/ - 2z = 2 



* + 2j/ - 3z = 6 
j/ — 2z = 2 



Thus the system is equivalent to 

x + 2y - 3z = 6 

y — 2z = 2 or simply 
j/ — 2z = 2 

(Since the second and third equations are identical, we can disregard one of them.) 

In echelon form there are only two equations in the three unknowns; hence the system has an 
infinite number of solutions and, in particular, 3-2 = 1 free variable which is z. 

To obtain the general solution let, say, z - a. Substitute into the second equation to obtain 
y = 2 + 2a. Substitute into the first equation to obtain x + 2(2 + 2a) - 3a = 6 or x = 2 - a. 
Thus the general solution is 

x = 2- a, y = 2 + 2a, z = a or (2 - a, 2 + 2a, a) 
where a is any real number. 

The value, say, a = 1 yields the particular solution x = 1, y = 4, z - 1 or (1,4, 1). 



x - Zy + 42 — 2w = 5 
2.5. Solve the system: 2y + 5z + w = 2 . 

y-Zz =4 

The system is not in echelon form since, for example, y appears as the first unknown in both 
the second and third equations. However, if we rewrite the system so that w is the second unknown, 
then we obtain the following system which is in echelon form: 

x — 2w — Zy + 4z = 5 

w + 2y + 5z = 2 

y — 3z = 4 

Now if a 4-tuple (a, b, e, d) is given as a solution, it is not clear if 6 should be substituted for 
w or for y; hence for theoretical reasons we consider the two systems to be distinct. Of course this 
does not prohibit us from using the new system to obtain the solution of the original system. 

Let z = a. Substituting into the third equation, we find y-A + Za. Substituting into the 
second equation, we obtain w + 2(4 + 3o) + 5a = 2 or w = -6 - 11a. Substituting into the first 

equation, » - 2(-6 - llo) - 3(4 + 3o) + 4o = 5 or x = 5 - 17a 

Thus the general solution of the original system is 

x = 5 - 17a, y = 4 + 3a, z = a, w = -6 - 11a 
where a is any real number. 
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2.6. Determine the values of a so that the following system in unknowns x, y and z has: 
(i) no solution, (ii) more than one solution, (iii) a unique solution: 

x + y — z — 1 

2x + 3y + az = 3 

x + ay + 3z = 2 

Reduce the system to echelon form. Eliminate x from the second and third equations by the 
operations L 2 -* — 2L X + L 2 and L 3 -» —I,! + L 3 : 

-2Li: -2» - 2j/ + 2z = -2 — L,: -x - y + z = -1 

L 2 : 2x + Sy + az = 3 L 3 : a; + ay + Sz = 2 



y + (a + 2)z= 1 (a-l)j/ + 4z= 1 

Thus the equivalent system is 

a; + y — z = 1 

2/ + (o + 2)« = 1 
(a- 4z = 1 

Now eliminate y from the third equation by the operation L 3 -> —(a — 1)L 2 + L 3 , 
-(a - 1)L 2 : -(a - l)y + (2 - a - a?)z = 1 - a 
L 3 : (a- 1)2/ + 4z = 1 

(6 - a - a 2 )z = 2 - a 
or (3 + a)(2 — a)z = 2 - a 

to obtain the equivalent system 

x + y — z = 1 

y + (a + 2)z = 1 

(3 + o)(2 - o)z = 2 - a 

which has a unique solution if the coefficient of z in the third equation is not zero, that is, if a ¥= 2 
and a ¥= —3. In case a = 2, the third equation is 0 = 0 and the system has more than one solu- 
tion. In case a = —3, the third equation is 0 = 5 and the system has no solution. 

Summarizing, we have: (i) a = —3, (ii) a = 2, (iii) a ¥> 2 and a ¥= —3. 



2.7. Which condition must be placed on a, b and c so that the following system in unknowns 
x, y and z has a solution? 

x + 2y — 3z = a 
2x + 6y - Hz = b 
x — 2y + Iz — c 

Reduce to echelon form. Eliminating x from the second and third equation by the operations 
L 2 -* — 2L 1 + L 2 and L 3 -» — L t + L 3 , we obtain the equivalent system 

x + 2y — 3z = a 

22/ — 5z = b — 2a 

—Ay + lOz = c — a 

Eliminating 2/ from the third equation by the operation L z -> 2L 2 + I/ 3 , we finally obtain the 
equivalent system 

x + 2y — 3z = a 

2j/ - 5z = 6 - 2a 

0 - c + 26 - 5o 
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The system will have no solution if the third equation is of the form 0 = k, with k ¥° 0; that is, 
if c + 26 - 5a # 0. Thus the system will have at least one solution if 

c + 26 — 5a = 0 or 5a = 26 + c 
Note, in this case, that the system will have more than one solution. In other words, the system 
cannot have a unique solution. 



HOMOGENEOUS SYSTEMS OF LINEAR EQUATIONS 

2.8. Determine whether each system has a nonzero solution: x + 2y- z = 0 

x - 2y + 3z - 2w = 0 x + 2y - 3z = 0 2x + 5y + 2z = 0 

3a: _ rjy _ 2z + Aw = 0 2x + 5y + 2z = 0 x + Ay + lz = 0 

Ax + Sy + 5z + 2w = 0 Sx - y - 4z = 0 a; + 3?/ + 32 = 0 

(i) (H) (*") 

(i) The system must have a nonzero solution since there are more unknowns than equations. 

(ii) Reduce to echelon form: 

x + 2y-3z = 0 x + 2j/-3 2 = 0 x + 2y - Sz = 0 

2* + 5y + 2z = 0 to V + 8z = 0 to 2/ + 8z = 0 

3a; - y - 42 = 0 -1y + 5z = 0 61z = 0 

In echelon form there are exactly three equations in the three unknowns; hence the system has 

a unique solution, the zero solution. 

(iii) Reduce to echelon form: 

x + 2y - z = 0 x + 2y - z = 0 

2a; + 5y + 22 = 0 y + 4z = 0 x + 2y - z = 0 

x + % + lz = 0 2y + 8z = 0 y + 4z = 0 

x + 3j/ + 3z = 0 j/ + 4z = 0 

In echelon form there are only two equations in the three unknowns; hence the system has a 
nonzero solution. 

2.9. The vectors Ui,...,Um in, say, R" are said to be linearly dependent, or simply 
dependent, if there exist scalars fci, . . . , km, not all of them zero, such that 
fetti + • • • + kmUm = 0. Otherwise they are said to be independent. Determine 
whether the vectors u, v and w are dependent or independent where: 

(i) u = (1, 1, -1), v = (2, -3, 1), w = (8, -7, 1) 

(ii) u = (1, -2, -3), v = (2, 3, -1), w = (3, 2, 1) 

(iii) u = (at, a 2 ), v - (6i, b 2 ), w = (ci, c 2 ) 

In each case: 

(a) let xu + yv + zw = 0 where x, j/ and z are unknown scalars; 
(6) find the equivalent homogeneous system of equations; 

(c) determine whether the system has a nonzero solution. If the system does, then the vectors are 
dependent; if the system does not, then they are independent. 

(i) Let xu + yv + zw = 0: 

x(l, 1, -1) + 1/(2, -3, 1) + z(8, -7, 1) = (0, 0, 0) 

or (*, », -*) + (2j/, -By, y) + (8z, -It, z) = (0, 0, 0) 

or (x + 2y + 8z, x-Zy-lz, -x + y + z) = (0,0,0) 
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Set corresponding components equal to each other and reduce the system to echelon form: 
x + 2y + 8z = 0 x + 2y+8z = 0 x + 2y + Sz = 0 x + 2y + Sz = 0 

x-Zy -7z = 0 -5y - 15z = 0 y + 3z = 0 y + 3z = 0 

-x+ y + z = 0 3j/+9z = 0 y + Sz = 0 

In echelon form there are only two equations in the three unknowns; hence the system has a 
nonzero solution. Accordingly, the vectors are dependent. 

Remark: We need not solve the system to determine dependence or independence; we only need 
to know if a nonzero solution exists. 

(ii) x(l, -2, -3) + y(2, 3, -1) + z(3, 2, 1) = (0, 0, 0) 
(x, -2x, -3x) + (2y, 3y, -y) + (3z, 2z, z) = (0, 0, 0) 

{x + 2y + 3z, -2x + 3y + 2z, -3x - y + z) = (0, 0, 0) 

x + 2y + 3z = 0 x + 2y+3z-Q x + 2y + 3z = 0 

-2x + 3y + 2z = 0 7y + Sz = 0 7y + 8z = 0 

-3x - y + z = 0 5# + lOz = 0 30z = 0 

In echelon form there are exactly three equations in the three unknowns; hence the system has 
only the zero solution. Accordingly, the vectors are independent. 

(iii) x(a u a 2 ) + y(b u b 2 ) + z(c 1( c 2 ) = (0, 0) 

fax, a 2 x) + (6,|/, btf) + (00, e 2 z) = (0, 0) and so 0l * + hlV + c i 2 ~ 0 

a 2 x + 6 2 3/ + c 2 z = 0 

+ b t y + c x z, a 2 x + b 2 y + c 2 z) — (0, 0) 

The system has a nonzero solution by Theorem 2.3, i.e. because there are more unknowns than 
equations; hence the vectors are dependent. In other words, we have proven that any three 
vectors in R 2 are dependent. 



2.10. Suppose in a homogeneous system of linear equations the coefficients of one of the 
unknowns are all zero. Show that the system has a nonzero solution. 

Suppose x u . . . , x n are the unknowns of the system, and Xj is the unknown whose coefficients 
are all zero. Then each equation of the system is of the form 

a 1 ar 1 + • • • + a^x^ + Ox, + a j + i x j + 1 + • • • + a n x n = 0 

Then for example (0, . . .,0, 1, 0, . . .,0), where 1 is the ;th component, is a nonzero solution of each 
equation and hence of the system. 



MISCELLANEOUS PROBLEMS 

2.11. Prove Theorem 2.1: Suppose u is a particular solution of the homogeneous system 
(*) and suppose W is the general solution of the associated homogeneous system (**). 

^ en u + W = {u + w: w G W} 

is the general solution of the nonhomogeneous system (*). 

Let U denote the general solution of the nonhomogeneous system (*). Suppose u G U and that 
u = (% u n ). Since u is a solution of (*), we have for i = 1, . . .,ra, 

a ix u x + a i2 u 2 + - • ■ + a in u n = b t 

Now suppose w 6 W and that w = (»„ . . .,«;„). Since w is a solution of the homogeneous system 
(**), we have for i = 1, . . . , m, 

Oi^x + a i2 w 2 + • • • + a in w n - 0 
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Therefore, for i = 1, . . . , m, 

+ w l) + *i2< M 2 + w 2> + - " " + a in( u n + w n) 

= a a u t + a il w 1 + a i2 U2 + a i2 w 2 + • • • + a in u n + a }n w n 

= (ajitti + aj2«2 H 1- a in u„) + (a n Wi + a i2 w 2 + ■•■ + a ln w n ) 

= bi + Q = bt 

That is, u + w is a solution of (*). Thus it + w e U, and hence 

it + W c U 

Now suppose v = . . . , v J is any arbitrary element of 17, i.e. solution of (*). Then, for 
i = l, ...,m, 

+ °i2 v 2 + • • - + OiA = & i 

Observe that v = We claim that v-uGW. For t = 1, ...,m, 

- M l) + a i2( v 2 - «2) + ' * * + «»>(•»« - "n) 

= (ail^l + «t2"2 H + a in v n) ~ ( a il u l + a i2 u 2 + ' - " + a in M n) 

= = 0 

Thus v - u is a solution of the homogeneous system (*), i.e. v-uGW. Then vGu+W, and hence 

U C u + W 

Both inclusion relations give us U = u + W; that is, u+W is the general solution of the 
nonhomogeneous system (**). 

2.12. Consider the system (*) of linear equations (page 18). Multiplying the ith equation 
by d, and adding, we obtain the equation 

(CiOu + • • • + CmOmi)Xi + • • ■ + (Cidm + • • • + C m Omn)Xn = Cibi + • • • + Cmbm (I) 

Such an equation is termed a linear combination of the equations in (*). Show that 
any solution of (*) is also a solution of the linear combination (1). 
Suppose u — (fe lt . . .,fc„) is a solution of (*). Then 

a u ftj + a i2 k 2 + • • • + a in k„ = 6;, t = 1, . . . , w (2) 

To show that tt is a solution of (J), we must verify the equation 

(c^n + • • • + c m a ml )fe x + • • • + (c t a ln + • • • + c m a mn )k n = + • • • + c m b m 

But this can be rearranged into 

eitenfci + • • • + a ln k„) + ■■■ + c m (a ml + ■■■ + a mn k n ) = + • • • + e m b m 

or, by (2), Ci&i + • • • + c m b m = cfa + • • • + c m b m 

which is clearly a true statement. 

2.13. In the system (*) of linear equations, suppose an ¥= 0. Let (#) be the system ob- 
tained from (*) by the operation Li-> -OuLi + auU, i¥-l. Show that (*) and (#) 
are equivalent systems, i.e. have the same solution set. 

In view of the above operation on (*), each equation in (#) is a linear combination of equations 
in (*); hence by the preceding problem any solution of (*) is also a solution of (#). 

On the other hand, applying the operation L t -* — {-a^ + L t ) to (#), we obtain the origi- 
nal system (*). That is, each equation in (*) is a linear combination of equations in (#); hence each 
solution of (#) is also a solution of (*). 

Both conditions show that (*) and (#) have the same solution set. 
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2.14. Prove Theorem 2.2: Consider a system in echelon form: 

duXi + a\iX% + CL13X3 + + atnXn = bi 

02J 2 £Cj 2 + a,2,j 2 +lXj 2 + l + + (tinXn = &2 

drj r Xj r + Or.j r + lX3 r +i + • • • + CirnXn = b r 

where 1< j t < ■ ■ • <j T and where an ¥> 0, a 2i2 . . ., a rir ¥* 0. The solution is as 
follows. There are two cases: 

(i) r = n. Then the system has a unique solution. 

(ii) r<n. Then we can arbitrarily assign values to the n — r free variables and 
obtain a solution of the system. 

The proof is by induction on the number r of equations in the system. If r = 1, then we have 
the single linear equation 

a t Xi + 02*2 + a A x 3 + " • • + o, n x n = °> where ¥> 0 

The free variables are x 2 , . . . , x n . Let us arbitrarily assign values to the free variables; say, 
x 2 = &2, *8 — ■ • • » x n — *V Substituting into the equation and solving for x lt 

*i = -7- (6 - o 2 fe 2 - <hfrs - ■■■ - a n k n ) 

These values constitute a solution of the equation; for, on substituting, we obtain 

«i ^ (b - a 2 k 2 — • • • - a n k n ) J + + • • • + a n k n = b or b = b 

which is a true statement. 

Furthermore if r = n = 1, then we have ax = b, where a ^ 0. Note that * = b/a is a solu- 
tion since a(6/a) = 6 is true. Moreover if x = k is a solution, i.e. ofe = 6, then A = b/a. Thus 
the equation has a unique solution as claimed. 

Now assume r > 1 and that the theorem is true for a system of r — 1 equations. We view the 
r — 1 equations 

a 2i 2 x ) 2 + *2,j 2 + i«i 2 + i + + a 2n x n - b 2 



a rj r x i r + a r,j r +\ x j r \\ + • • • + a rn x n = b r 

as a system in the unknowns x j2 *„. Note that the system is in echelon form. By induction 

we can arbitrarily assign values to the (n - j 2 + 1) - (r - 1) free variables in the reduced system 
to obtain a solution (say, x >2 = k^, x n — k n ). As in case r = 1, these values and arbitrary 
values for the additional ? 2 — 2 free variables (say, x 2 = k 2 , . .., » 3 - 2 -i = fcj 2 -i). yield a solution 
of the first equation with 

»! = — (6j — a 12 k 2 - ■ ■ • — a ln k„) 

(Note that there are (n — j 2 + 1) — (r — 1) + (j 2 — 2) = n — r free variables.) Furthermore, these 
values for x u . . . , x„ also satisfy the other equations since, in these equations, the coefficients of 
»j 2 -i are zero. 

Now if r = n, then j 2 = 2. Thus by induction we obtain a unique solution of the subsystem 
and then a unique solution of the entire system. Accordingly, the theorem is proven. 

2.15. A system (*) of linear equations is defined to be consistent if no linear combination 
of its equations is the equation 

Ozi + 0X2 + ■ ■ ■ + 0x„ = b, where b ¥■ 0 (1) 

Show that the system (*) is consistent if and only if it is reducible to echelon form. 

Suppose (*) is reducible to echelon form. Then it has a solution which, by Problem 2.12, is a 
solution of every linear combination of its equations. Since (1) has no solution, it cannot be a linear 
combination of the equations in (*). That is, (*) is consistent. 

On the other hand, suppose (*) is not reducible to echelon form. Then, in the reduction process, 
it must yield an equation of the form (1). That is, (1) is a linear combination of the equations in (*). 
Accordingly (*) is not consistent, i.e. (*) is inconsistent. 
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Supplementary Problems 

SOLUTION OF LINEAR EQUATIONS 

2x + Sy = 1 2* + 4j/ = 10 4* - 2j/ = 5 

2.16. Solve: (i) " «i) (iii) 

5x + 7j/ = 3 V ' 3x + 6y = 15 v ' -6x + 3?/ = 1 



2.17. Solve: 

2x + y - 3« = 5 2x + 3j/ - 2« = 5 x + 2j/ + 32 = 3 

(i) 3* - 2j/ + 22 = 5 (ii) x - 2y + 3z = 2 (iii) 2x + 3y + 82 = 4 

5x - 3y - 2 = 16 4* - y + Az = 1 3x + 2y + 172 = 1 



2.18. Solve: 

2x + 3y = 3 x + 2y - 32 + 2w = 2 x + 2j/ - 2 + 3w - 3 

(i) a; - 2y = 5 (ii) 2x + 6y - 82 + 6w> = 5 (iii) 2s + Ay + 42 + 3w = 9 

3x + 2y = 7 3a; + Ay - 5z + 2w = 4 3x + 6j/ - 2 + 8w = 10 



x + 2y + 22 = 



x + 5y + 4z - 13w = 3 



3x — 2y - 2 = 5 

2.19. Solve: (i) 2x _ 5y + 3z = _ 4 TO 3x - „ + 22 + Bw = 2 

2x + 2y + 3z - Aw = 1 

as + 4y + 62 = 0 



2.20. Determine the values of k such that the system in unknowns x, y and 2 has: (i) a unique solution, 
(ii) no solution, (iii) more than one solution: 

kx + y + 2 = 1 

x + 2y + kz = 1 

(o) x + ky + z = 1 (6) 

2x + ky + 82 = 3 

a; + y + kz = 1 



2.21. Determine the values of k such that the system in unknowns x, y and 2 has: (i) a unique solution, 
(ii) no solution, (iii) more than one solution: 

x+ y + kz - 2 x -32 = -3 

(o) 3x + Ay + 2z-k (b) 2x + ky - 2 = -2 

2x + 3y — z = 1 x + 2y + kz — 1 



2.22. Determine the condition on a, b and c so that the system in unknowns *, y and 2 has a solution: 

x + 2y — 32 = a x — 2y + Az = a 

(i) 3x - y + 2z = b (ii) 2x + Sy - 2 = 6 

x — by + 82 = c 3a: + y + 22 = e 



HOMOGENEOUS SYSTEMS 

2.23. Determine whether each system has a nonzero solution: 

x + 3y — 22 = 0 x + 3y — 2z = 0 x + 2y - 5z + Aw = 0 

(i) x - 83/ + 82 = 0 (u) 2x - Zy + z = 0 (iii) 2x - 3y + 2z + 3w = 0 

3x - 2j/ + 42 = 0 3x — 2» + 22 = 0 4x - 7j/ + 2 - 6w = 0 
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2.24. Determine whether each system has a nonzero solution: 

x - 2y + 2z = 0 2x - Ay + Iz + Av - 5w = 0 

2.x + y - 2z = 0 9a; + 3y + 2z - 7v + w = 0 
(i) (ii) 

3a; + Ay - 6z = 0 5a; + 2y - 3z + v + Zw = 0 

3x - llj/ + 12z = 0 6a; - 5y + Az - Sv - 2w = 0 



2.25. Determine whether the vectors u, v and w are dependent or independent (see Problem 2.9) where: 

(i) u = (1, 3, -1), v = (2, 0, 1), m; = (1, -1, 1) 

(ii) u = (1, 1, -1), v = (2, 1, 0), w = (-1, 1, 2) 

(iii) m = (1, -2, 3, 1), v = (3, 2, 1, -2), w = (1, 6, -5, -A) 

MISCELLANEOUS PROBLEMS 

2.26. Consider two general linear equations in two unknowns x and y over the real field R: 

ax + by = e 
ex + dy = f 

Show that: 

(i) if ^ 2 , i-e. if ad - be ^ 0, then the system has the unique solution x = de , - b J , 
c a ad - be' 

= af-ee, 
y ad-be' 



(ii) if ^ = ^ * J> tn «n the system has no solution; 

a b e 

(iii) if — = ^ = J> then the system has more than one solution. 



2.27. Consider the system ax + by = 1 

ex + dy = 0 

Show that if ad — be ¥° 0, then the system has the unique solution x = d/(ad — be), y = —e/(ad — be). 
Also show that if ad - be = 0, c ¥■ 0 or d ¥> 0, then the system has no solution. 



2.28. Show that an equation of the form 0x t + 0x 2 + • • • + 0x„ = 0 may be added or deleted from a 
system without affecting the solution set. 



2.29. Consider a system of linear equations with the same number of equations as unknowns: 

a 11 x 1 + a 12 x 2 + • • • + a ln x n = 6 t 

021X1 + °22*2 + • * * + 0 2 „X„ = 6 2 

a nl Xi + o B2 x 2 + • • • + o nB x„ = 6„ 

(i) Suppose the associated homogeneous system has only the zero solution. Show that (1) has a 
unique solution for every choice of constants b { . 

(ii) Suppose the associated homogeneous system has a nonzero solution. Show that there are 
constants b t for which (I) does not have a solution. Also show that if {1) has a solution, then 
it has more than one. 
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Answers to Supplementary Problems 

2.16. (i) * = 2, y = -1; (ii) x = 5 - 2a, y = a; (iii) no solution 

fx = -1-7 

2.17. (i) (1,-3,-2); (ii) no solution; (iii) (-1 - 7a, 2 + 2a, a) or i = g + & 

2.18. (i) as = S, y = -1 , , „ 

fx = — z + 2w 

(ii) (-a + 26,1 + 2a -26, a, 6) or = x + 2z _ 2w 

fa; = 7/2 - 5w/2 - 2^/ 

(iii) (7/2 - 56/2 - 2a, a, 1/2 + 6/2, 6) or <j = 1/g + w/2 

2.19. (i) (2,1,-1); (ii) no solution 

2.20. (a) (i) fe »* 1 and fe # -2; (ii) fc = -2; (iii) fc = 1 

(6) (i) never has a unique solution; (ii) fc = 4; (iii) fe ¥= 4 

2.21. (o) (i) fe ^ 3; (ii) always has a solution; (iii) fe = 3 
(6) (i) fe ^ 2 and fe # -5; (ii) fe = -5; (iii) fe = 2 

2.22. (i) 2o - 6 + e = 0. (ii) Any values for a, 6 and c yields a solution. 

2.23. (i) yes; (ii) no; (iii) yes, by Theorem 2.3. 

2.24. (i) yes; (ii) yes, by Theorem 2.3. 

2.25. (i) dependent; (ii) independent; (iii) dependent 
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INTRODUCTION 

In working with a system of linear equations, only the coefficients and their respective 
positions are important. Also, in reducing the system to echelon form, it is essential to 
keep the equations carefully aligned. Thus these coefficients can be efficiently arranged in 
a rectangular array called a "matrix". Moreover, certain abstract objects introduced in 
later chapters, such as "change of basis", "linear operator" and "bilinear form", can also 
be represented by these rectangular arrays, i.e. matrices. 

In this chapter, we will study these matrices and certain algebraic operations defined on 
them. The material introduced here is mainly computational. However, as with linear 
equations, the abstract treatment presented later on will give us new insight into the 
structure of these matrices. 

Unless otherwise stated, all the "entries" in our matrices shall come from some arbitrary, 
but fixed, field K. (See Appendix B.) The elements of K are called scalars. Nothing essen- 
tial is lost if the reader assumes that K is the real field R or the complex field C. 

Lastly, we remark that the elements of R" or C" are conveniently represented by "row 
vectors" or "column vectors", which are special cases of matrices. 



MATRICES 

Let K be an arbitrary field. A rectangular array of the form 



where the a« are scalars in K, is called a matrix over K, or simply a matrix if K is implicit. 
The above matrix is also denoted by (ay), i = 1, . . . , to, / = 1, . . . , n, or simply by (a«). 
The to horizontal n-tuples 



are its columns. Note that the element Oi,-, called the ij-entry or ij-component, appears in 
the ith row and the jth column. A matrix with to rows and n columns is called an to by n 
matrix, or to x n matrix; the pair of numbers (to, n) is called its size or shape. 




(an, a 12 , . . . , ain), (a 2 i, a 22 , . . . , a2n), . . . 
are the rows of the matrix, and the n vertical TO-tuples 
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Example 3.1: The following is a 2 X 3 matrix 

Its rows are (1, —3, 4) and (0, 5, —2); its columns are 

Matrices will usually be denoted by capital letters A,B, and the elements of the 

field K by lower case letters a, b, Two matrices A and B are equal, written A = B, if 

they have the same shape and if corresponding elements are equal. Thus the equality of 
two m x n matrices is equivalent to a system of mn equalities, one for each pair of elements. 

Example 3.2: The statement ( x + y 2z + w \ _ /* 5 \ is equ i va i e nt to the following system 

\x-y z-wj \1 4/ 

of equations: 

x + y = Z 
x — y =1 
2z + w = 5 
z — w = 4 

The solution of the system is * = 2, y = 1, z = 3, w = — 1. 

Remark: A matrix with one row is also referred to as a row vector, and with one column 
as a column vector. In particular, an element in the field K can be viewed as 
a 1 x 1 matrix. 



MATRIX ADDITION AND SCALAR MULTIPLICATION 

Let A and B be two matrices with the same size, i.e. the same number of rows and of 
columns, say, m x n matrices: 

6m 
&2n 





(an 


ttl2 


. am \ 






(bu 


bi2 




am 


Ci22 


. din 


and 


B = 




ba 




\dml 


ttm2 


. . (tmn t 






\ bml 


bm2 



The sum of A and B, written A+B,is the matrix obtained by adding corresponding entries: 



A + B = 



I an + bu «i2 + biz ... Om + bm 

021 + b21 «22 + b22 ... ^2n + b2n 



\ Oml + bml Om2 + bm2 



Omn ~t~ bi 



The product of a scalar fc by the matrix A, written k • A or simply fcA, is the matrix obtained 
by multiplying each entry of A by k: 

kan kau . . . kai n \ 
ka2i ka.22 • • • ka2i 



kA = 

\ ka m l kOm2 ■ ■ ■ kOm 

Observe that A+B and kA are also mxn matrices. We also define 

-A = -1-A and A — B — A + (-B) 
The sum of matrices with different sizes is not defined. 
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Then 



Example 3.3: Let A = f 1 2 3 \ and B = ( 3 ° 

\4 5 -6/ \^-7 1 

/1 + 3 -2 + 0 3 + 2N _ / 4 -2 5N 
V4-7 5 + 1-6 + 8/ - \-Z 6 2^ 

/3«1 3 »(-2) 3»3 \ _ / 3 -6 9\ 
^3-4 3-5 3-(-6)y (^12 15 -18/ 

2A - 3B = ( 2 ~ 4 6 ^ + (-* ° " 6N ) = ^ OX 

^8 10 -12/ V21 -3 -24/ ^ 29 7 -36/ 

Example 3.4: The mXn matrix whose entries are all zero, 

10 0 ... 0 
0 0 ... 0 



i0 0 ... 0 



is called the zero matrix and will be denoted by 0. It is similar to the scalar 0 in 
that, for any m X n matrix A = (a 4j ), A + 0 = (ay + 0) = (a tj ) = A. 

Basic properties of matrices under the operations of matrix addition and scalar multi- 
plication follow. 

Theorem 3.1 : Let V be the set of all m X n matrices over a field K. Then for any matrices 
A,B,C GV and any scalars k u fc 2 £ K, 

(i) (A+B) + C = A + (B + C) (v) k^A + B) = &iA + k t B 

(ii) A + 0 = A (vi) (ki + h)A = kiA + k%A 

(iii) A + (-A) = 0 (vii) (fc,fe)A = k^kiA) 

(iv) A+B = B + A (viii) 1-A = A and 0A = 0 
Using (vi) and (viii) above, we also have that A + A = 2A, A + A+A = 3A, ... . 

Remark: Suppose vectors in R n are represented by row vectors (or by column vectors); 

u — (01, a 2 , . . . , a„) and v = (61, & 2 , . . . , &„) 

Then viewed as matrices, the sum u + v and the scalar product ku are as follows: 

u + v = (ai + b u a 2 + & 2 , . . . , a„ + b n ) and few = (fed, ka 2 , . .., ka„) 

But this corresponds precisely to the sum and scalar product as defined in 
Chapter 1. In other words, the above operations on matrices may be viewed 
as a generalization of the corresponding operations defined in Chapter 1. 



MATRIX MULTIPLICATION 

The product of matrices A and B, written AB, is somewhat complicated. For this 
reason, we include the following introductory remarks. 

(i) Let A = (Oi) and B = belong to R", and A represented by a row vector and B by a 
column vector. Then their dot product A • B may be found by combining the matrices 
as follows: 

lbl\ 

A'B = (oi, 02, . . ., On) I * J = eti&i + a 2 & 2 + • • • + a n b n 

w 

Accordingly, we define the matrix product of a row vector A by a column vector B as 
above. 
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buXi + 612*2 + 613*3 = yi 

(ii) Consider the equations (1) 

&21*1 + &22*2 + &23«3 = 2/2 

This system is equivalent to the matrix equation 

'bu 6 12 bls \h\ = /yA orgimply BX=y 
621 622 bizj\ x \ \y2j 

where B — (6«), X = (xt) and Y = (yi), if we combine the matrix B and the column 
vector X as follows: 

BX ( bn bl2 hl3 \l Xl \ ( bnxi + bizx 2 + bi S x 3 \ ■_ fBi-X 

\6 2 1 6 2 2 b2a )\xJ \ 621*1 + 622X2 + 623*3/ \Bs'X 

where Si and B 2 are the rows of B. Note that the product of a matrix and a column 
vector yields another column vector. 

dnVi + ai2#2 = zi 

(iii) Now consider the equations (2) 

chiyi + 0222/2 = z% 

which we can represent, as above, by the matrix equation 

-Y*) = («) or simply AY = Z 

O21 0122/ V^V V 212 / 

where A = (o«), Y = (#i) as above, and Z = (z s ). Substituting the values of yi and 2/2 
of (1) into the equations of (2), we obtain 

an(6ii*i + 612*2 + 613*3) + (112(621*1 + 622*2 + 623*3) = Zi 

021(611*1 + 612*2 + 613*3) + 022(621*1 + 622*2 + 623*3) = Zi 

or, on rearranging terms, 

(an6n + ai2&2i)*i + (011&12 + ai2&22)*2 + (on&is + oi2&23)*3 = Zi 

(3) 

((l2l6ll + 022&2l)*l + (O21612 + a22&22)*2 + (021&13 + <l22&23)*3 = Z 2 

On the other hand, using the matrix equation BX = Y and substituting for Y into 
AY = Z, we obtain the expression 

ABX = Z 

This will represent the system (3) if we define the product of A and B as follows: 

Oil Ol2 V 611 612 613 \ / fflll&H + 012&21 O11612 + (I12622 011613 + 012623 



AB — 

021 022/V 621 622 623/ \ 021611 + 022621 021612 + 022622 021613 + 022623 

Ai-B 1 Ai-B 2 Ai-B 3 
A2-B 1 A 2 -B 2 A2-B 3 

where Ai and A 2 are the rows of A and B X ,B 2 and B 3 are the columns of B. We em- 
phasize that if these computations are done in general, then the main requirement is 
that the number of yi in (1) and (2) must be the same. This will then correspond to the 
fact that the number of columns of the matrix A must equal the number of rows of 
the matrix B. 
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With the above introduction, we now formally define matrix multiplication. 

Definition: Suppose A = (a«) and B = (&«) are matrices such that the number of columns 
of A is equal to the number of rows of B; say, A is an m x p matrix and B is a 
pxn matrix. Then the product AB is the m x n matrix whose y-entry is 
obtained by multiplying the ith row A { of A by the yth column B' of B: 



AB 



/ Ai-B 1 Ai'B 2 ... Ai-£" 
Ai-B 1 Ai-B 2 ... A 2 -B n 



\Am-B 1 A m 'B 2 



A m 'B"j 



That is, 

/flu 



ai P 



\a m i . . . a mp / \& P i 
where cy = aubij + a i2 &2j + 



.. 


... &i„\ 




/Cu 


& 


. . . &pn/ 




\Cml 



Cm 



— 2 a «£^fcj- 
k = l 



We emphasize that the product AB is not defined if A is an m x p matrix and B is a 
9Xm matrix, where p ¥> q. 



Example 3.5: 



Example 3.6: 



(r &\(a^ 02 o 3 \ /mi + sfej ra 2 + sb 2 ra 3 + sb 3 \ 

\t uj\b 1 6 2 &3/ ytaj + wd! to 2 + m& 2 * a 3 + m6 3 / 

/l 2\/l 1\ _ /1-1 + 2-0 1-1 + 2-2N _ n 5\ 
\3 4/\0 2/ ~ \3'l + 4-0 3«l + 4«2/ ~ (3 11/ 

/1-1 + 1-3 l-2 + l«4\ /4 6\ 

^0-1^2-3 0 • 2 + 2 • 4 / ~ ( 6 8/ 



1 1 

0 2 



1 2 
3 4 



The above example shows that matrix multiplication is not commutative, i.e. the products 
AB and BA of matrices need not be equal. 

Matrix multiplication does, however, satisfy the following properties: 

Theorem 3.2: (i) {AB)C = A(BC), (associative law) 

(ii) A(B + C) = AB + AC, (left distributive law) 

(iii) (B + C)A — BA + CA, (right distributive law) 

(iv) k(AB) = (kA)B = A(kB), where A; is a scalar 

We assume that the sums and products in the above theorem are defined. 
We remark that OA = 0 and 50 = 0 where 0 is the zero matrix. 



TRANSPOSE 

The transpose of a matrix A, written A', is the matrix obtained by writing the rows of 
A, in order, as columns: 

/an «2i ... 

ffll2 0.22 ■ . ■ 



an 


tti2 


. Om 


a 2 i 


a 2 2 


a 2n 


ttml 


Om2 





Observe that if A is an m x n matrix, then A % is annxm matrix. 
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Example 3.7: 

The transpose operation on matrices satisfies the following properties: 
Theorem 3.3: (i) (A+B)* = A* + B* 

(ii) (A*)* = A 

(iii) (kAy — kA*, for k a scalar 

(iv) (ABy = BW 




MATRICES AND SYSTEMS OF LINEAR EQUATIONS 

The following system of linear equations 

auXl + d\lX 2 + • • • + dmXn = &1 
a 2 lXi + a-LiXl + • • • + CiznXn — b 2 



(1) 



OmlXi + a m 2«2 + 

is equivalent to the matrix equation 



+ dmnXn — &t> 



an 


ai2 


. am \ 






<t 


a2i 


tt22 


din \ 1 


x% 






Oml 


CLm2 


. (Imn/ 


w 







or simply AX = B 



(2) 



where A = (ay), X = (xt) and B = (b t ). That is, every solution of the system {1) is a 
solution of the matrix equation (2), and vice versa. Observe that the associated homogeneous 
system of (1) is then equivalent to the matrix equation AX = 0. 

The above matrix A is called the coefficient matrix of the system (1), and the matrix 



/an an 

«21 «22 



dm 
d%n 



bi 

b 2 



is called the augmented matrix of (1). Observe that the system (1) is completely determined 
by its augmented matrix. 



Example 3.8: 



The coefficient matrix and the augmented matrix of the system 

2x + 3y - 4z = 7 
x - 2y - 5z = 3 
are respectively the following matrices: 

(l J Z\) ™* 
Observe that the system is equivalent to the matrix equation 

'2 



-4 
-5 



G -i =:)(:) ■ ( 



In studying linear equations it is usually simpler to use the language and theory of 
matrices, as indicated by the following theorems. 



CHAP. 3] 



MATRICES 



41 



Theorem 3.4: Suppose Ui,U2, ...,Un are solutions of a homogeneous system of linear 
equations AX = 0. Then every linear combination of the of the form 
kmi + k 2 U2 + • ■ ■ + knUm where the fa are scalars, is also a solution of 
AX = 0. Thus, in particular, every multiple ku of any solution u of 
AX — 0 is also a solution of AX — 0. 

Proof. We are given that Aui - 0, Au 2 = 0, . . . , Attn = 0. Hence 

A(kui + ku 2 + • • • + ku n ) — faAui + kzAuv + • • • + knAiu 

- feiO + k 2 0 + • • • + fc„0 = 0 
Accordingly, faui + ■ ■ ■ + k n Un is a solution of the homogeneous system AX = 0. 

Theorem 3.5: Suppose the field K is infinite (e.g. if K is the real field R or the complex 
field C). Then the system AX = B has no solution, a unique solution or 
an infinite number of solutions. 

Proof. It suffices to show that if AX = B has more than one solution, then it has 
infinitely many. Suppose u and v are distinct solutions of AX = B; that is, Au = B and 
Av = B. Then, for any k G K, 

A(u + k(u-v)) = Au + k(Au-Av) = B + k(B-B) = B 

In other words, for each k E K, u + k(u - v) is a solution of AX = B. Since all such solu- 
tions are distinct (Problem 3.31), AX-B has an infinite number of solutions as 
claimed. 



ECHELON MATRICES 

A matrix A = (an) is an echelon matrix, or is said to be in echelon form, if the number 
of zeros preceding the first nonzero entry of a row increases row by row until only zero 
rows remain; that is, if there exist nonzero entries 

o>u v <*2j 2 , . . . , a r j T , where ji < fa < ■ • ■ < h 

with the property that 

an — 0 for i — r,j< % and for i > r 
We call a\j v . . ., a T j r the distinguished elements of the echelon matrix A. 

Example 3.9: 



The following are echelon matrices where the distinguished elements have been 
circled: 

2 
0 
0 



2 
0 

0 



3 
0 
0 
0 



2 

© 
0 

0 



0 4 

1 -3 
0 0 
0 0 



5 -e\ 

2 0 
0 2 
0 0 



0 
0 



/I© 

0 0 
0 0 

\o 0 



0 0 4 0 \ 

0 0-3 0 

0 0 2 0 

0 0 0 0/ 



In particular, an echelon matrix is called a row reduced echelon matrix if the dis- 
tinguished elements are: 

(i) the only nonzero entries in their respective columns; 

(ii) each equal to 1. 

The third matrix above is an example of a row reduced echelon matrix, the other two are 
not. Note that the zero matrix 0, for any number of rows or of columns, is also a row 
reduced echelon matrix. 



ROW EQUIVALENCE AND ELEMENTARY ROW OPERATIONS 

A matrix A is said to be row equivalent to a matrix B if B can be obtained from A by a 
finite sequence of the following operations called elementary row operations: 
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[Ei]: Interchange the ith row and the jth row: R, Rj. 

[E 2 ]: Multiply the ith row by a nonzero scalar k: R { -» kR,, k¥=0. 

[E s ]: Replace the ith row by k times the jth row plus the ith row: Ri -> kRj + Ri. 

In actual practice we apply [E 2 ] and then [E 3 ] in one step, i.e. the operation 
[E]: Replace the ith row by k' times the jth row plus k (nonzero) times the ith row: 
Ri -* k'Rj + kRi, k ¥> 0. 

The reader no doubt recognizes the similarity of the above operations and those used 
in solving systems of linear equations. In fact, two systems with row equivalent aug- 
mented matrices have the same solution set (Problem 3.71). The following algorithm is 
also similar to the one used with linear equations (page 20). 

Algorithm which row reduces a matrix to echelon form: 

Step 1. Suppose the ji column is the first column with a nonzero entry. Inter- 
change the rows so that this nonzero entry appears in the first row, that is, 
so that aijj ¥■ 0. 

Step 2. For each i > 1, apply the operation 

Ri ~* —Ui^Ri + ttijj-Ri 

Repeat Steps 1 and 2 with the submatrix formed by all the rows excluding the first. 
Continue the process until the matrix is in echelon form. 

Remark: The term row reduce shall mean to transform by elementary row operations. 

Example 3.10: The following matrix A is row reduced to echelon form by applying the operations 
R 2 -* —2R X + R 2 and R 3 -* — ZR t + R s , and then the operation R 3 ■* -5R 2 + 4R 3 : 






/I 


2 


-3 


o\ 




/I 


2 


-3 




to 


0 


0 


4 




-1 


0 


0 


4 


i) 




1° 


0 


5 


3/ 




\° 


0 


0 





Now suppose A = (o^) is a matrix in echelon form with distinguished elements 
aij v . . ., a T j T . Apply the operations 

Rk -» -a kh Ri + (hjltk, k = 1, . . . , i - 1 
for i = 2, then i = 3, . .., i-r. Thus A is replaced by an echelon matrix whose dis- 
tinguished elements are the only nonzero entries in their respective columns. Next, multiply 
Ri by a" 1 , i^r. Thus, in addition, the distinguished elements are each 1. In other words, 
the above process row reduces an echelon matrix to one in row reduced echelon form. 

Example 3.11: On the following echelon matrix A, apply the operation R x -* —4H 2 + 3Ri and then 
the operations Ri -» R$ + Ri and R 2 ~* — Bfl 3 + 2R 2 : 

/2 3 4 5 6\ /6 9 0 7 -2\ /6 9 0 7 0^ 

A= 0 0 3 2 5 J to 0 0 3 2 5 to 0 0 6 4 0 
\0 0 0 0 2/ \0 0 0 0 2/ \0 0 0 0 2/ 

Next multiply Ri by 1/6, R 2 by 1/6 and R 3 by 1/2 to obtain the row reduced echelon 

matrix , . 

'l 3/2 0 7/6 0^ 

0 0 12/3 0 
k 0 0 0 0 l y 

The above remarks show that any arbitrary matrix A is row equivalent to at least one 
row reduced echelon matrix. In the next chapter we prove, Theorem 4.8, that A is row 
equivalent to only one such matrix; we call it the row canonical form of A. 
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SQUARE MATRICES 

A matrix with the same number of rows as columns is called a square matrix. A square 
matrix with n rows and n columns is said to be of order n, and is called an n-square matrix. 
The diagonal (or: main diagonal) of the n-square matrix A = (ay) consists of the elements 

Oil, tt22, . . . , ftnn. 

Example 3.12: The following is a 3-square matrix: 

Its diagonal elements are 1, 5 and 9. 

An upper triangular matrix or simply a triangular matrix is a square matrix whose 
entries below the main diagonal are all zero: 




'(111 Ol2 
0 ffl22 



ai„\ 

<l2n 



0 0 



or 



a„„/ 



/ an ai 2 

tt22 
\ 



Similarly, a lower triangular matrix is a square matrix whose entries above the main 
diagonal are all zero. 

A diagonal matrix is a square matrix whose non-diagonal entries are all zero: 



jai 0 ... 








0 a 2 ... 


0 


or 


a 2 


\o 0 ... 


a„/ 




\ 



' On/ 

In particular, the n-square matrix with l's on the diagonal and O's elsewhere, denoted by /„ 
or simply /, is called the unit or identity matrix; e.g., 

'l 0 0^ 



h = 



0 10 

10 0 1, 



This matrix I is similar to the scalar 1 in that, for any n-square matrix A, 

AI = IA = A 

The matrix kl, for a scalar k e K, is called a scalar matrix; it is a diagonal matrix whose 
diagonal entries are each k. 



ALGEBRA OF SQUARE MATRICES 

Recall that not every two matrices can be added or multiplied. However, if we only 
consider square matrices of some given order n, then this inconvenience disappears. Specif- 
ically, the operations of addition, multiplication, scalar multiplication, and transpose can be 
performed on any n x n matrices and the result is again an n x n matrix. 

In particular, if A is any n-square matrix, we can form powers of A: 

A 2 = AA, A 3 = A 2 A, '. . . and A" — I 

We can also form polynomials in the matrix A: for any polynomial 

f(x) = a 0 + aix + 02X 2 + ■ ■ ■ + a n x n 
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where the a 4 are scalars, we define /(A) to be the matrix 

/(A) = aj + diA + a,2A 2 + • • • + a„A n 
In the case that /(A) is the zero matrix, then A is called a zero or root of the polynomial f{x). 



Example 3.13: Let A = ^3 — 4) > then A * = (3 



2\ /l 2\ _ / 7-6 
3 -4 7 ( 3 -4 / ~ t -9 22 



If /(as) = 2a;2 - 3a; + 5, then 



«> - •(-' 2) - «(i -!) ♦ il °) - (-U t) 



If «r(x) = x* + 3x- 10, then 



- (4 ^ ) + »(i .:) - »(i ?) ■ c :) 



Thus A is a zero of the polynomial g(x). 



INVERTIBLE MATRICES 

A square matrix A is said to be invertible if there exists a matrix B with the property 
that 

AB = BA = I 

where / is the identity matrix. Such a matrix B is unique; for 

ABi = BiA = I and AB 2 = B2A = / implies Bi = BJ = Bi(AB 2 ) = {BiA)B z = IB 2 = B 2 

We call such a matrix B the inverse of A and denote it by A -1 . Observe that the above 
relation is symmetric; that is, if B is the inverse of A, then A is the inverse of B. 



Example 3.14: 



c :)(-: 
( 



3 -5\ /6-5 -10 + 10N f\ 0 

2/ ~ (3-3 -5 + 6 j U 1 



3 -5\/2 5\ _ / 6-5 15-15\ /l 0 

-1 2] 1 3/ _ 1-2 + 2 -5 + 6 ) " U 1 



/2 5\ /3-5\ 
Thus ( j g ) and f J are invertible and are inverses of each other. 

We show (Problem 3.37) that for square matrices, AB — I if and only if BA — I; hence 
it is necessary to test only one product to determine whether two given matrices are in- 
verses, as in the next example. 

-11 2 2\ 
Example 3.15: (2-1 3 II -4 0 1 




2 + 0 


-2 


2 + 0 


-2\ 




0 




4 + 0 


-3 


4-1 


-3 


-6 


1 


3 


8 + 0 


-8 


8 + 1 


-8/ 




0 





6-1-1/ 

Thus the two matrices are invertible and are inverses of each other. 

fa b\ 

We now calculate the inverse of a general 2x2 matrix A — [ J. We seek scalars 
x, y, z, w such that \ ' 

a v\ _ / 1 0\ /ax + bz ay + bw\ _ / 1 0 

c d)\z w J ~ I 0 1/ ° r \cx + dz cy + dw J \0 1 
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which reduces to solving the following two systems of linear equations in two unknowns: 

(ax + bz = 1 jay + bw = 0 

cx +dz = 0 \cy + dw = 1 

If we let |A| = ad -be, then by Problem 2.27, page 33, the above systems have solutions if 
and only if |A| ¥= 0; such solutions are unique and are as follows: 

_ d _ d_ _ —b _ ^b_ _ — c _ — c _ a _ a 
x ~ ad - be ~ \A\ ' V ~ ad -be ~ \A\ ' Z ~ ad - be ~ \A\' W ~ ad-bc~ \A\ 

Accordingly A- = ( d/|A| ~ b/|A| ^ = M * ~ b \ 

Accordingly, ^_ c/(A | a /| A | J \ A \{-e a) 

Remark: The reader no doubt recognizes \A\ = ad — bc as the determinant of the matrix 
A; thus we see that a 2 x 2 matrix has an inverse if and only if its determinant 
is not zero. This relationship, which holds true in general, will be further 
investigated in Chapter 9 on determinants. 



BLOCK MATRICES 

Using a system of horizontal and vertical lines, we can partition a matrix A into smaller 
matrices called blocks (or: cells) of A. The matrix A is then called a block matrix. Clearly, 
a given matrix may be divided into blocks in different ways; for example, 



/I 


-2 


0 


1 


3\ 




/I 


- 2 


0 


1 j 3\ 


2 


3 


5 


7 


-2 




2 


3 


5 


7 j-2 


\» 


1 


4 


5 


9/ 




\3 


1 


4 


5 { 9/ 



!l -2 0 j 1 



3 
1 



47 5 



3' 
7 -2 
9/ 



The convenience of the partition into blocks is that the result of operations on block matrices 
can be obtained by carrying out the computation with the blocks, just as if they were the 
actual elements of the matrices. This is illustrated below. 

Suppose A is partitioned into blocks; say 



A = 



An 


A12 . 


. A 


A 2 i 


A22 . 


. A 


Ami 


Am2 


. A 



Multiplying each block by a scalar k, multiplies each element of A by k; thus 

. . . kAm \ 
. . . kA 2l 





fkAu 


feAl 2 


kA = 1 


kA 2 i 


kA 2 2 




\kA m i 


kA m 2 



Now suppose a matrix B is partitioned into the same number of blocks as A; say 

/ Bn B12 
B21 B22 



B = 



\B„ 
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Furthermore, suppose the corresponding blocks of A and B have the same size. Adding 
these corresponding blocks, adds the corresponding elements of A and B. Accordingly, 



A + B 



'An + Bn 
A21 + B21 



A12 + B12 
A22 4- B22 



Aln + Bin 

A2n + Bin 



Ami + Bml Am2 + B, 



m2 



+ B 



The case of matrix multiplication is less obvious but still true. That is, suppose matrices 
U and V are partitioned into blocks as follows 



U = 



U12 
U22 



Ui P \ 

u 2t 



ml 



u mp J 



and 



V = 



fV u 7 12 
V21 V22 



Vm\ 
V 2 



\V P i V22 



V V nj 



such that the number of columns of each block U ik is equal to the number of rows of each 
block V kj . Then 

IWn W12 ... 
W21 W22 ... 



UV = 



where 



\Wmi Wm2 

Wa = UiiVu + Ui2V2i + 




The proof of the above formula for UV is straightforward, but detailed and lengthy. It 
is left as a supplementary problem (Problem 3.68). 



Solved Problems 

MATRIX ADDITION AND SCALAR MULTIPLICATION 
3.1. Compute: 

'1 2-3 4\ /3 -5 6 -1' 
0-5 1 — 1 J U 0-2-3 



(i) 



.... /I 2 -3\ , /3 5\ ..... n /l 2-3 

(U) (o-4 l) + (l- 2 J (1U) - 3 ( 4 -5 6 

(i) Add corresponding entries: 



/l 2 -3 4\ /3 -5 6 -1\ 
\0 -5 1 -l) + \2 0 -2 -ZJ 

/1 + 3 2- 5 -3 + 6 4 - 1N /4 -3 3 3\ 
^0 + 2 -5 + 0 1-2-1-3/ ~ 1,2-5-1-4/ 



(ii) The sum is not defined since the matrices have different shapes. 

(iii) Multiply each entry in the matrix by the scalar —3: 



/l 2 -3\ / -3 -6 9\ 

~ 3 \4 -5 6/ ~ V— 12 15 -18/ 
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** ™ A = (3 0 -!)' 5 = (J -1 1)' C = (I — 1 -l)- Find 3A + 4B-2C. 

First perform the scalar multiplication, and then the matrix addition: 

„, , „„ „„ /6 -15 3\ /4 -8 -12\ / 0 -2 4\ /10 -25 -5\ 
3A + 4B - 2C = ( g Q _ 12 ) + ( 0 _ 4 20 ) + (_ 2 2 2 ) = ( 7 _ 2 10 ) 

_ lx y\ I x 6 \ / 4 # + «\ 

3.3. Finda;,2/,zandwif 3( * = ( , _ +( , _ * . 

First write each side as a single matrix: 

/Zx 3j/\ / a; + 4 a; + i/ + 6\ 

\3z 3^/ ~ \z + w - 1 2m> + 3 / 

Set corresponding entries equal to each other to obtain the system of four equations, 

3x = x + 4 2x = 4 

3y = x + y + 6 2y = 6 + x 
or 

3z = 2 + w — 1 22 = 10 — 1 

Zw = 2tu + 3 w = 3 

The solution is: x — 2, j/ = 4, z = 1, to = 3. 

3.4. Prove Theorem 3.1(v): Let A and B be rnxn matrices and k a scalar. Then 
k(A+B) = kA + kB. 

Suppose A = (ay) and B = (6 i3 ). Then a tj + 6y is the #-entry of A + B, and so k(a i} + 6 W ) 
is the ij'-entry of fc(A + B). On the other hand, ka tj and kby are the i;'-entries of kA and fcB respec- 
tively and so ka tj + kby is the y-entry of kA + kB. But k, a tj and 6 ( , are scalars in a field; hence 

fc(Ojj + b { j) = ka-ij + kb ijr for every i, j 

Thus k(A + B) = kA + kB, as corresponding entries are equal. 

Remark: Observe the similarity of this proof and the proof of Theorem l.l(v) in Problem 1.6, page 
7. In fact, all other sections in the above theorem are proven in the same way as the 
corresponding sections of Theorem 1.1. 

MATRIX MULTIPLICATION 

3.5. Let (r x s) denote a matrix with shape rxs. Find the shape of the following products 
if the product is defined: 

(i) (2x3)(3x4) (iii) (1 x 2)(3 x 1) (v) (3 x 4)(3 x 4) 

(ii) (4xl)(lx2) (iv) (5x2)(2x3) (vi) (2x2)(2x4) 

Recall that an m X p matrix and a q X n matrix are multipliable only when p = q, and then 
the product is an m X n matrix. Thus each of the above products is denned if the "inner" numbers 
are equal, and then the product will have the shape of the "outer" numbers in the given order. 

(i) The product is a 2 X 4 matrix. 

(ii) The product is a 4 X 2 matrix. 

(iii) The product is not denned since the inner numbers 2 and 3 are not equal. 

(iv) The product is a 5 X 3 matrix. 

(v) The product is not denned even though the matrices have the same shape. 

(vi) The product is a 2 X 4 matrix. 
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3.6. Let A = 



1 3 

2 -1 



and B = 



2 0-4 
3-2 6 



Find (i) AB, (ii) BA. 



(i) Since A is 2 X 2 and B is 2 X 3, the product AB is denned and is a 2 X 3 matrix. To obtain the 
entries in the first row of AB, multiply the first row (1, 3) of A by the columns (^j , ^ 
, ^ of B, respectively: 



and 



(!-!)( 



0 -4 



3 -2 



- ( 



1-2 + 3-3 1-0 + 3- (-2) l.(-4) + 3.6 r 



2 + 9 0-6 -4 + 18 



11 -6 14 



) 



To obtain the entries in the second row of AB, multiply the second row (2, —1) of A by the 
columns of B, respectively: 

/ 1 3 \ / 2 0 -4 \ /ll -6 14 \ 

U -1 A3 -2 9 J U-2 + (-D-3 2.0+(-l).(-2) 2 • (-4) + (-1) • %) 

Th- AB = [l 2 -1 4 ) 



(ii) Note that B is 2 X 3 and A is 2 X 2. Since the inner numbers 3 and 2 are not equal, the product 
BA is not defined. 



3.7. Given A = (2,1) and B = ^ 5 ' find (*) AjB ' (") BA - 

(i) Since A is 1 X 2 and B is 2 X 3, the product AB is defined and is a 1 X 3 matrix, i.e. a row 
vector with 3 components. To obtain the components of AB, multiply the row of A by each 
column of B: 

AB = (£,!)( J ~l 4) = (2- 1 + 1. 4,2. (-2) + 1 • 5,2- 0 + 1- (-3)) = (6,1,-3) 

(ii) Note that B is 2 X 3 and A is 1 X 2. Since the inner numbers 3 and 1 are not equal, the product 
BA is not defined. 




3.8. Given A = \ 1 0| and B = ('J J , find (i) AB, (ii) BA. 

(i) Since A is 3 X 2 and B is 2 X 3, the product AB is defined and is a 3 X 3 matrix. To obtain the 
first row of AB, multiply the first row of A by each column of B, respectively: 

-3 -4-4 -10 + 0\ 1-1 -8 -10^ 




To obtain the second row of AB, multiply the second row of A by each column of B, 
respectively: 

(aMai|(||: -\) - (1+0 -«"*• -»''•) - ( ; -« -') 

To obtain the third row of AB, multiply the third row of A by each column of B, respectively: 
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)(I 'I'D- 



-1 


-8 


-10 \ 






-8 




1 


-2 




1 ■ 1 




-2 




3 + 12 


6 + 16 


15 + oy 




\ 9 


22 


15/ 




Thus AB = 



(ii) Since B is 2 X 3 and A is 3 X 2, the product i?A is defined and is a 2 X 2 matrix. To obtain the 
first row of BA, multiply the first row of B by each column of A, respectively: 

2 -1 \ 

1 0 = /2-2 + 15 -1 + 0-20N _ /15 -21 \ 
To obtain the second row of BA, multiply the second row of B by each column of A, respectively: 

) J * = (e + 4 + o -3 + 0 + 0) = (10 -3) 




1 -2 -5 



1 —3- 4 

Z 15 -21 \ 

ThUS BA = (lO - 3 ) 



Remark: Observe that in this case both AB and BA are defined, but they are not equal; in fact they 
do not even have the same shape. 



3.9. Let A = 



2-10 
1 0-3 



and 




(i) Determine the shape of AB. (ii) Let c« denote the element in the ith row and 
jth column of the product matrix AB, that is, AB = (dj). Find: C23, Cu and C21. 

(i) Since A is 2 X 3 and B is 3 X 4, the product AB is a 2 X 4 matrix. 

(ii) Now c y is defined as the product of the ith row of A by the ith column of B. Hence: 



C23 = (1, 0, -3) 



= 1 • 0 + 0 • 3 + (-3) • (-2) = 0 + 0 + 6 = 6 



c, 4 = (2,-1,0) 



= 2*1 + (-!)•(-!) + 0«0 = 2 + 1 + 0 = 



c 21 = (1, 0, -3) 



= 1 « 1 + 0 • 2 + (-3) • 4 = 1 + 0-12 = -11 



MO. Compute: ,., ( .J _J) (Hi, J) (v, 9.-^ 

<«> ( i !Y I) w (1)^ 



-3 5/\-7/ v ' \6y 

(i) The first factor is 2 X 2 and the second is 2 X 2, so the product is defined and is a 2 X 2 matrix: 
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/ 1 6\/4 0\ / 1*4 + 6-2 l*0 + 6*(-l) \ / 16 -6 \ 

\-3 5 A 2 -1/ V(-3)*4 + 5*2 (-3)-0 + 5 • (-1)7 ~ ^-2 -5 J 

(ii) The first factor is 2 X 2 and the second is 2 X 1, so the product is defined and is a 2 X 1 matrix: 

( 1 6 V 2 \ ( 1,2 + 6, (- 7 ) \ _ /-40\ 

V-3 V(-3)*2 + 5-(-7)y 

(iii) Now the first factor is 2 X 1 and the second is 2X2. Since the inner numbers 1 and 2 are 
distinct, the product is not defined. 

(iv) Here the first factor is 2 X 1 and the second is 1 X 2, so the product is defined and is a 2 X 2 
matrix: 



G)« ■ G:.* i::) - Gs J) 



(v) The first factor is 1 X 2 and the second is 2 X 1, so the product is defined and is a 1 X 1 matrix 
which we frequently write as a scalar. 

(2.~l)(_g) = (2*1 + (-1)* (-6)) = (8) = 8 

3.11. Prove Theorem 3.2(i): (AB)C = A(BC). 

Let A = (ay), B = (6 Jfc ) and C = (c k ,). Furthermore, let AB — S — (s ik ) and BC = T = (t n ). 
Then 

m 

*Ut = a nf>ik + ai2*2k + • • • + a im b mk = 2 Oyfcjk 

71 

*jl = *J1«11 + &j2«2I + • • • + & jn c n , = 2 b jk c M 

k = l 

Now multiplying S by C, i.e. (AB) by C, the element in the ith row and ith column of the matrix 
(AB)C is 



n m 



»il c ll + S i2 C 2l + • • • + 8 in C„i = 2 S,k«kl =22 («tfM C M 

k = l k = l j = l 

On the other hand, multiplying A by T, i.e. A by JSC, the element in the ith row and Ith column 
of the matrix A(BC) is 

m m n 

a n t u + a a t 2l + ■ ■ ■ + a im t ml = 2 =22 a«(&jfcCki) 

i=i >=i k=i 

Since the above sums are equal, the theorem is proven. 

3.12. Prove Theorem 3.2(ii): A(B + C) = AB + AC. 

Let A = (a y ), B = (b jk ) and C = (c Jk ). Furthermore, let D - B + C = (d )k ), E = AB = (e ik ) 
and F = AC - (/ jk ). Then 

djk = &jk + c jk 

m 

6ik = a il&lk + <»«2&2k + • • • + a>imt>mk = 2 «y6jk 

771 

fik ~ «il c lk + «i2 c 2k + • • • + aimC mfc = 2 "ijCjk 

j=l 

Hence the element in the ith row and feth column of the matrix AB + AC is 

wi m m 

«ik + Ak = 2 «iA-k + 2 aijC,'k = 2 «jj(&jk + cjk) 
1=1 i=i i=i 

On the other hand, the element in the ith row and fcth column of the matrix AD = A(B + C) is 

771 Ttt 

Oil^lk + Uildzk + • • • + <>-imd mk = 5 a i) d jk = .2 a«(&jk + Cjk) 
Thus A(B + C) = AB + AC since the corresponding elements are equal. 
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TRANSPOSE h o 

3.13. Find the transpose A* of the matrix A - 2 3 



\4 4 4 4 



Rewrite the rows of A as the columns of A': A t = 




3.14. Let A be an arbitrary matrix. Under what conditions is the product AA* defined? 

Suppose A is an m X n matrix; then 4» is uX m. Thus the product AA* is always denned. 
Observe that A*A is also defined. Here AA* is an m X m matrix, whereas A*A is an it X n matrix. 



3.15. Let A = (I °\. Find (i) AA*, (ii) A*A. 

To obtain A', rewrite the rows of A as columns: A* = 2 —1 
/i o n\ / 1 3 \ 

AA' = 




A 'A = 



G -> !)| 

/ l«l + 2'2 + 0'0 1*8 + 2-(-l) + 0-4 \ 
U * 1 + (-1) '2 + 4-0 3 • 3 + (-1) • (-1) + 4-4/ 

G -i :) 

1-1 + 3-3 1 • 2 + 3 • (-1) 1 • 0 + 3 • 4 
2-1 + (-1) -3 2 • 2 + (-1) • (-1) 2-0 + (-1) -4 
0-1 + 4-3 0-2 + 4* (-1) 0-0 + 4-4 




Then 



(l 2e) 




3.16. Prove Theorem 3.3(iv): (AS)' = B*A t . 

Let A = (oy) and B = (b jk ). Then the element in the ith row and jth column of the matrix 
AB is 

(*,!&!, + a i2 6 2j + • • • + a^b^ (1) 
Thus (7) is the element which appears in the j'th row and ith column of the transpose matrix (AB)*. 
On the other hand, the jth row of B* consists of the elements from the jth column of B: 

(6 1} b 2i ... 6 mj ) (2) 
Furthermore, the ith column of A' consists of the elements from the ith row of A: 



w/ 



(8) 



Consequently, the element appearing in the ith row and ith column of the matrix B*A* is the 
product of (2) by (S) which gives (1). Thus (AB)* = B*A*. 
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ECHELON MATRICES AND ELEMENTARY ROW OPERATIONS 
3.17. Circle the distinguished elements in each of the following echelon matrices, 
are row reduced echelon matrices? 

/l 2-3 0 1 
0 0 5 2 -4 
\0 0 0 7 

The distinguished elements are the first nonzero entries in the rows; hence 
jQ 2 -8 0 l\ / o © 7 -6 <>\ /© 0 5 0 





0 



0 
0 




® 

o/ 



©20 
0 0 © 



Which 



An echelon matrix is row reduced if its distinguished elements are each 1 and are the only nonzero 
entries in their respective columns. Thus the second and third matrices are row reduced, but the 
first is not. 



/I -2 3 -1\ 

3.18. Given A = 2 -1 2 2 . (i) Reduce A to echelon form, (ii) Reduce A to row 
\8 1 2 8/ 
canonical form, i.e. to row reduced echelon form. 

(i) Apply the operations R 2 -* ~2Ri + R 2 and R 3 -» -32?! + R 3 , and then the operation 
R 3 -» —7R 2 + SR 3 to reduce A to echelon form: 

/l -2 3 -l\ /l -2 3 -l\ 

A to 0 3-4 4 to 0 3-4 4 
\0 7 -7 6/ \0 0 7 -10/ 

ii) Method 1. Apply the operation R t -» 2R 2 + 3R V and then the operations «i -» -R 3 + IRi 
and R 2 -» 4i? 3 + 7R 2 to the last matrix in (i) to further reduce A: 

(Z 0 1 B\ /21 0 0 45 \ 

to 0 3 -4 4 to 0 21 0-12 
\0 0 7 -10/ \ 0 0 7 -10/ 

Finally, multiply R x by 1/21, R 2 by 1/21 and R s by 1/7 to obtain the row canonical form of A: 

jl 0 0 15/7 \ 

0 1 0 -4/7 
\0 0 1 -10/7/ 

Method 2. In the last matrix in (i), multiply R 2 by 1/3 and R 3 by 1/7 to obtain an echelon 
matrix where the distinguished elements are each 1: 




Now apply the operation R 1 -* 2R 2 + R u and then the operations R 2 -» (4/3)fi 3 + R 2 and 
jBj -> (—1/3)R 3 + Ri to obtain the above row canonical form of A. 

Remark: Observe that one advantage of the first method is that fractions did not appear 
until the very last step. 
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3.19. Determine the row canonical form of 



to 




Note that the third matrix is already in echelon form. 




3.20. Reduce A = 



I 6 3 -4\ 

-4 1 -6 to echelon form, and then to row reduced echelon form, 
\ 1 2-5/ 
i.e. to its row canonical form. 

The computations are usually simpler if the "pivotal" element is 1. Hence first interchange the 
first and third rows: 



to 



1 


2 


-6\ 




/I 


2 


~*\ 


4 


1 


-6 


to | 


0 


9 


-26 


6 


3 


-4/ 




\o 


-9 


26/ 




to 



fl 2 -5 \ /l 0 7/9 \ 

0 1 -26/9 to 0 1 -26/9 

\0 0 0 / \0 0 0 / 

Note that the third matrix is already in echelon form. 



3.21. Show that each of the following elementary row operations has an inverse operation 
of the same type. 

[Ei]: Interchange the ith row and the jth row: R t <-> R } . 

[E 2 ]: Multiply the ith row by a nonzero scalar k: R f -» kR u k ¥* 0. 

[E a ]: Replace the ith row by k times the ?'th row plus the ith row: R t -> kR } + R u 

(i) Interchanging the same two rows twice, we obtain the original matrix; that is, this operation 
is its own inverse. 

(ii) Multiplying the ith row by k and then by fc -1 , or by fc- 1 and then by fe, we obtain the original 
matrix. In other words, the operations i? { -» kR { and R { -* are inverses. 

(iii) Applying the operation R { -» kRj + i? 4 and then the operation R t -» -kRj + R 0 or apply- 
ing the operation R { -* —kRj + R t and then the operation R t -» kRj + R it we obtain the orig- 
inal matrix. In other words, the operations R { -* kRj + R t and R t -» — kRj + R { are 
inverses. 



SQUARE MATRICES 

3.22. Let A = ^ _J j . Find (i) A 2 , (ii) A 8 , (iii) f(A), where = 2x s - Ax + 5. 

«•> - - - - (1 -X -3) 

/ 1-1 + 2-4 l'2 + 2-(-3) \ / 9 -4\ 

V4'l + (-3)-4 4-2 + (-3) -(-3)7 ~ [-8 17/ 
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« » - "> - (j -:X4 ») 

/ l-9 + 2-(-8) l-(-4) + 2-17 \ /-7 30\ 

(^4-9 + (-3) '(-8) 4 «(-4) + (-3) -17/ ^ 60 -67/ 

(iii) To find /(A), first substitute A for a; and 57 for the constant 5 in the given polynomial 
f(x) = 2«3 - 4x + 5: 

= « + « = ^ 7,)- 4 G X 1 . 5) 

Then multiply each matrix by its respective scalar: 

/-14 60S / -4 -8\ /B 0\ 
v 120 -134; ^ \ -16 12 J \0 5 J 

Lastly, add the corresponding elements in the matrices: 

/-14-4 + 5 60 - 8 + 0 N = /-13 52N 
~ (,120 - 16 + 0 -134 + 12 + 5/ \ 104 -117/ 

3.23. Referring to Problem 3.22, show that A is a zero of the polynomial g(x) = x 2 + 2x - 11. 

A is a zero of g(x) if the matrix g(A) is the zero matrix. Compute g(A) as was done for /(A), 
i.e. first substitute A for x and 11/ for the constant 11 in g(x) = &+2x- 11: 

,<„, (_» + 4)-n(J J) 

Then multiply each matrix by the scalar preceding it: 

- (°s ro + c -:) + r. -n) 

Lastly, add the corresponding elements in the matrices: 

/ 9 + 2-11 -4 + 4 + 0N _ /0 ON 
~ ^_ 8 + 8 + 0 17- 6- 11 y \0 0j 

Since g(A) = 0, A is a zero of the polynomial g(x). 



3.24. Given A = ^ . Find a nonzero column vector u = such that A« = 3m. 
First set up the matrix equation Am = 3m: 

G -»)© " ' 

Write each side as a single matrix (column vector): 

/ x + 3y\ _ 
V.4* - 3j// 

Set corresponding elements equal to each other to obtain the system of equations (and reduce to 
echelon form): 

x + By = Bx ^ 2x - By = 0 or 2x - By = 0 Qr 2x - Sy = 0 
^x-By ~ By ^ 4x - 6y - 0 0-0 

The system reduces to one homogeneous equation in two unknowns, and so has an infinite number 
of solutions. To obtain a nonzero solution let, say, y = 2; then x = 3. That is, x = 3, y — 2 is a 



U-L SU1UL1U11B. XU LCtlll « Jivmiwu .jvj. .> j .7 ' - 

solution of the system. Thus the vector « = ( 2 ) is nonzero and has the property that Am = 3m. 
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3.25. Find the inverse of 



3 5 



2 3 

Method 1. We seek scalars x, y, z and w for which 

GX-O-GO - ( 



3x + 5z 3y + 5w 
2x + 3z 2y + 3w 



) - G 0 



or which satisfy 



3k + 5z = 1 
2x + 3z — 0 



and 



f3y + 
L2J/ + 



5w = 0 
3w = 1 



The solution of the first system is x — —3, z 
Thus the inverse of the given matrix is 



2, and of the second system is y = 5, w = —3. 



Method 2. We derived the general formula for the inverse A -1 of the 2X2 matrix A 

= \A-\(- e 'I) where l A l = od-bc 
Thus if A = I ~ ~ ) , then \A\ = 9 - 10 = -1 and A' 1 



( 2 3)' 



-C I) 

=-<ri)=(-i-d- 



MISCELLANEOUS PROBLEMS 

3.26. Compute AB using block multiplication, where 



Here A 
Hence 



AB = 




and 



B = 



/I 
4 



2 3 I 1\ 
5 6 I 1 



\o 0 0 1 1/ 

^ where ^, F, G, R, S and T are the given blocks. 

/9 12 15N /8\ + /l\\ 
\19 26 33/ \7 J \0j\ _ 

( 0 0 0) (2) 



9 12 15 4' 
19 26 33 7 
0 0 0 2, 



3.27. Suppose B = (R u R 2 , R n ), i.e. that i? 4 is the ith row of B. Suppose BA is de- 
fined. Show that BA = (RiA, RiA, . . .,R„A), i.e. that RiA is the ith row of BA. 

Let A 1 , A 2 , . . .,A m denote the columns of A. By definition of matrix multiplication, the ith row 
of BA is (Ri •A 1 ,R i 'A 2 , A m ). But by matrix multiplication, R^A - (R t • A 1 , i? 4 • A 2 , 

R { -A m ). Thus the ith row of BA is R { A. 



3.28. Let ei = (0, . . . , 1, . . . , 0) be the row vector with 1 in the ith position and 0 else- 
where. Show that eiA = Ri, the ith row of A. 

Observe that e { is the ith row of I, the identity matrix. By the preceding problem, the ith row 
of I A is ejA. But I A = A. Accordingly, e { A = R it the ith row of A. 

3.29. Show: (i) If A has a zero row, then AB has a zero row. 

(ii) If B has a zero column, then AB has a zero column. 

(iii) Any matrix with a zero row or a zero column is not invertible. 

(i) Let jRj be the zero row of A, and B l , . ..,B n the columns of B. Then the ith row of AB is 

(fl 4 -£i, R r m, Je,-J5») = (0, 0, .... 0) 
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(ii) Let Cj be the zero column of B, and A u . ..,A m the rows of A. Then the jth column of AB is 

Mi-cA /o 

Az-CA = 0 

(iii) A matrix A is invertible means that there exists a matrix A -1 such that A A" 1 = A -1 A — I. 
But the identity matrix J has no zero row or zero column; hence by (i) and (ii) A cannot have 
a zero row or a zero column. In other words, a matrix with a zero row or a zero column cannot 
be invertible. 



3.30. Let A and B be invertible matrices (of the same order). Show that the product 
AB is also invertible and (AS) -1 = 2? _1 A _1 . Thus by induction, (AiA 2 - • - An) -1 = 
An 1 • • ■Ai 1 Ai 1 where the A f are invertible. 

(A£)(£-iA-i) = A(BJ?-i)A-i = A/A"i = AA -i = / 

and (B-U-i)(AB) = B~HA~iA)B = B~UB = B~^B = I 

Thus (AJS)-i = B-iA-i. 



3.31. Let u and v be distinct vectors. Show that, for each scalar k G K, the vectors 
u + k(u — v) are distinct. 
It suffices to show that if 

u + fc|(w-v) = u + k 2 (u-v) U) 
then &! = k 2 . Suppose (1) holds. Then 

k^u-v) = k 2 (u — v) or (fci - k 2 )(u - v) = 0 
Since u and v are distinct, u — v ¥• 0. Hence fe t — fc 2 = 0 and fc x = k 2 . 



ELEMENTARY MATRICES AND APPLICATIONS* 

3.32. A matrix obtained from the identity matrix by a single elementary row operation is 
called an elementary matrix. Determine the 3-square elementary matrices corre- 
sponding to the operations Ri <r> R 2 , R3 ■* —7Rs and R2 -* —3Ri + R2. 



Apply the operations to the identity matrix I 3 = 0 1 0 to obtain 





E, = 1 0 0 , E 2 = 



0 


0\ 




/ 1 


0 




1 


0 




-3 


1 




0 


-7/ 




\ 0 


0 





3^3. Prove: Let e be an elementary row operation and E the corresponding m-square elemen- 
tary matrix, i.e. E-e(I m ). Then for any mxn matrix A, e(A) = EA. That is, the re- 
sult e(A) of applying the operation e on the matrix A can be obtained by multiplying 
A by the corresponding elementary matrix E. 

Let R { be the ith row of A; we denote this by writing A = (R x R m ). By Problem 3.27, if 

B is a matrix for which AB is defined, then AB = (R^B, R m B). We also let 

e t = (0, ...,0,1,0, 0), a -i 

*This section is rather detailed and may be omitted in a first reading. It is not needed except for certain 
results in Chapter 9 on determinants. 
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Here a = i means that 1 is the ith component. By Problem 3.28, e { A = We also remark that 
I = («!, . . ., e m ) is the identity matrix. 

(i) Let e be the elementary row operation R t «-> Rj. Then, for a = i and A — j, 

E = e(I) = (ei ...,«£ ...,•„) 

and e(A) - (R lf ...,£>...,$,..., R m ) 

Thus /\ A . ^ A 

#A = ( ei A, ..., ej A, ..., et A, ...,e m A) = («!,...,«,,...,«, R m ) = e(A) 

(ii) Now let e be the elementary row operation R { -» fc # 0. Then, for a = i, 

£ - e (/) = (e u ...,ke u ...,e m ) and e(A) = [R u . . ., kR t , . . ., R m ) 
Thus EA = ( ei A, ...,ClA, ...,e m A) = (R u ..., fcB i( .... R m ) = e(A) 

(iii) Lastly, let e be the elementary row operation R t -* kRj + R ( . Then, for a = i, 

E = e(7) = (e lt ...,k^+7 it ...,e m ) and e(A) = (R lt . . . , kR^+R^, . . . , RJ 
Using (ke } + e ( )A = k(e } A) + e { A - kR } + R { , we have 

EA = ( ei A, ...,'(ke^+70A, ...,e m A) = («!,...,*;«, + «„...,«„) = e(A) 
Thus we have proven the theorem. 

3.34. Show that A is row equivalent to B if and only if there exist elementary matrices 
Ei,...,E s such that E s • • ■ E2E1A = B. 

By definition, A is row equivalent to B if there exist elementary row operations e lt ...,e„ for 
which «,(• • •(e 2 (« 1 (A)))- • •) = B. But, by the preceding problem, the above holds if and only if 
E t ' • -E^E X A = B where E { is the elementary matrix corresponding to e 4 . 

3.35. Show that the elementary matrices are invertible and that their inverses are also 
elementary matrices. 

Let E be the elementary matrix corresponding to the elementary row operation e: e(7) = E. 
Let e' be the inverse operation of e (see Problem 3.21) and E' its corresponding elementary matrix. 
Then, by Problem 3.33, 

/ = e'(e(/)) = e'E = E'E and / = e(e'(/)) = eE' = EE' 

Therefore E' is the inverse of E. 



3.36. Prove that the following are equivalent: 

(i) A is invertible. 

(ii) A is row equivalent to the identity matrix /. 

(iii) A is a product of elementary matrices. 

Suppose A is invertible and suppose A is row equivalent to the row reduced echelon matrix B. 
Then there exist elementary matrices E lt E 2 , ■ ■ -,E S such that E s - ■ ■E 2 E 1 A = B. Since A is invert- 
ible and each elementary matrix E t is invertible, the product is invertible. But if B ¥* I, then B 
has a zero row (Problem 3.47); hence B is not invertible (Problem 3.29). Thus B = I. In other 
words, (i) implies (ii). 

Now if (ii) holds, then there exist elementary matrices E lt E 2 , ■■■,E S such that 

Es-'-EzE^ = /, and so A = (E s - ■ •E 2 E 1 )~ 1 = E^E^-'-E* 1 

By the preceding problem, the E t 1 are also elementary matrices. Thus (ii) implies (iii). 

Now if (iii) holds (A = E 1 E 2 . . .E s ), then (i) must follow since the product of invertible 
matrices is invertible. 
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3.37. Let A and B be square matrices of the same order. Show that if AB = /, then 
B = A-K Thus AB = I if and only if BA = I. 

Suppose A is not invertible. Then A is not row equivalent to the identity matrix J, and so A 
is row equivalent to a matrix with a zero row. In other words, there exist elementary matrices 

E x E„ such that E s - • -E 2 E X A has a zero row. Hence E s - ■ •E 2 E l AB has a zero row. Accordingly, 

AB is row equivalent to a matrix with a zero row and so is not row equivalent to /. But this con- 
tradicts the fact that AB = J. Thus A is invertible. Consequently, 

B = IB = (A-U)B = A-HAB) = A~V = A -1 



1.38. Suppose A is invertible and, say, it is row reducible to the identity matrix J by the 
sequence of elementary operations e u . . ., e«. (i) Show that this sequence of elemen- 
tary row operations applied to I yields A' 1 , (ii) Use this result to obtain the inverse 

11 0 2\ 
of A = 2 -1 3 . 

\4 1 8/ 

(i) Let E { be the elementary matrix corresponding to the operation e { . Then, by hypothesis and 
Problem 3.34, E n - ■ -EJS X A = /. Thus (E n - • ■E 2 E 1 I)A — I and hence A i = E n ---EJEJ. 
In other words, A -1 can be obtained from I by applying the elementary row operations e 1( . . .,e„. 

(ii) Form the block matrix (A, /) and row reduce it to row canonical form: 



(A, /) = 



to 





0 


2 


(i 


-1 


3 




1 


8 


0 


2 1 


1 


-1 


-1 1 


-2 


0 




-6 



to 




0 


2 ! 


1 


0 


-1 


- 1 ! 


-2 


1 


1 


0 1 


-4 


0 


) 0 


-11 


2 


2 


0 


4 


0 


-1 


-1 


-6 


1 


1 










i) 

















Observe that the final block matrix is in the form (/, B). Hence A is invertible and B is its 



inverse: 



Remark: 




A -1 = 



In case the final block matrix is not of the form (/, B), then the given matrix is not 
row equivalent to / and so is not invertible. 



Supplementary Problems 

MATRIX OPERATIONS 

In Problems 3.39-3.41, let 

!)■ --U'JD- c = 

3.39. Find: (i) A + B, (ii) A + C, (iii) 3A - 4B. 

3.40. Find: (i) AB, (ii) AC, (iii) AD, (iv) BC, (v) BD, (vi) CD. 

3.41. Find: (i) A*, (ii) A*C, (iii) DW, (iv) B*A, (v) DW, (vi) DD*. 
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3.42. Let ei= (1,0,0), e 2 = (0,1,0) and e s = (0,0,1). Given A 
(ii) e 2 A, (iii) e 3 A. 

3.43. Let e t = (0, . . 0, 1, 0, . . ., 0) where 1 is the ith component. Show the following: 

(i) Be*. = Cj, the ith column of B. (By Problem 3.28, e { A = J? { .) 

(ii) If e ; A = e t B for each i, then A = B. 

(iii) If Ae\ = Be\ for each i, then A = B. 

ECHELON MATRICES AND ELEMENTARY ROW OPERATIONS 

3.44. Reduce A to echelon form and then to its row canonical form, where 

/l 2-1 2 l\ /2 3 -2 5 l\ 

(i) A = 2 4 1 -2 3 , (ii) A = 3 -1 2 0 4 . 

\3 6 2-6 5/ \4 -5 6 -5 7/ 




find (i) e x A, 



3.45. Reduce A to echelon form and then to its row canonical form, where 
/l 3 -1 2\ 




0 


1 


3 


-2 


0 


4 


-1 




0 


0 


2 




0 


5 


-3 





3.46. Describe all the possible 2X2 matrices which are in row reduced echelon form. 



3.47. Suppose A is a square row reduced echelon matrix. Show that if A ¥= I, the identity matrix, then 
A has a zero row. 



3.48. Show that every square echelon matrix is upper triangular, but not vice versa. 



3.49. Show that row equivalence is an equivalence relation: 

(i) A is row equivalent to A; 

(ii) A row equivalent to B implies B row equivalent to A; 

(iii) A row equivalent to B and B row equivalent to C implies A row equivalent to C. 
SQUARE MATRICES 

3.50. Let A = Q . (i) Find A2 and A3, (ii) If /(*) = & - 3x2 _ 2 x + 4, find /(A), (iii) If 
g(x) = x 2 - x - 8, find g(A). 

3.51. Let B = ^ jj j . (i) If f(x) = 2x* - Ax + 3, find /(B). (ii) If g(x) = *2 - 4* - 12, find »(B). 
(iii) Find a nonzero column vector u = ( } such that Z?w = 6w. 

w 

3.52. Matrices A and B are said to commute if AB = BA. Find all matrices [ " J which commute 

wlth (o ij- 

3.53. Let A = Q ^ . Find A". 
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3.54. Let A = (J J) and B=Q . 

Find: (i) A + B, (ii) AB, (iii) A 2 and A», (iv) A», (v) /(A) for a polynomial f(x). 

« l* „.(;;). j ■■■ j). »-h *]. .»„«>. 

3.56. Suppose the 2-square matrix B commutes with every 2-square matrix A, i.e. AB = BA. Show that 

g _ /fc 0\ for some sca i ar f e> i >e< B ia & scalar matrix. 
\0 fc/ 

3.57. Let D fc be the m-square scalar matrix with diagonal elements fc. Show that: 

(i) for any m X n matrix A, D k A = fcA; (ii) for any w X m matrix £, = kB. 

3.58. Show that the sum, product and scalar multiple of: 

(i) upper triangular matrices is upper triangular; 

(ii) lower triangular matrices is lower triangular; 

(iii) diagonal matrices is diagonal; 

(iv) scalar matrices is scalar. 



INVERTIBLE MATRICES 

3.59. Find the inverse of each matrix: 



<»(? *)•<«> ai)- 



-1 2 -3\ /2 1 -V 

3.60. Find the inverse of each matrix: (i) 2 1 0 , (ii) 0 2 1 

\ 4 -2 5/ \5 2 -3/ 

/ 1 3 4\ 

3.61. Find the inverse of 3 -1 6 

\-l 5 II 

3.62. Show that the operations of inverse and transpose commute; that is, (A*) -1 = (A -1 )'. Thus, in 
particular, A is invertible if and only if A* is invertible. 

invertible, and what is its inverse? 







0 . 


. 0 


3.63. When is a diagonal matrix A = 


0 




. 0 




\o 


0 . 


• <*n 



3.64. Show that A is row equivalent to B if and only if there exists an invertible matrix P such that 
B = PA. 

3.65. Show that A is invertible if and only if the system AX = 0 has only the zero solution. 
MISCELLANEOUS PROBLEMS 

3.66. Prove Theorem 3.2: (iii) (B + OA = BA + CA; (iv) fc(AB) = (fcA)B = A(fcB), where fc is a scalar. 
(Parts (i) and (ii) were proven in Problem 3.11 and 3.12.) 

3.67. Prove Theorem 3.3: (i) (A + By = At + B*; (ii) (A*)* = A; (iii) ikA)* = fcA*, for & a scalar. 
(Part (iv) was proven in Problem 3.16.) 

3.68. Suppose A = (A ifc ) and B = (B kj ) are block matrices for which AB is defined and the number of 
columns of each block A ik is equal to the number of rows of each block B kj . Show that AB - (C {j ) 
where C i3 - = 2 A ik B kj . 
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3.69. The following operations are called elementary column operations: 
[2? x ]: Interchange the ith column and the j'th column. 

[E 2 ]: Multiply the ith column by a nonzero scalar A;. 

[E 3 ]: Replace the ith column by k times the jth column plus the ith column. 
Show that each of the operations has an inverse operation of the same type. 

3.70. A matrix A is said to be equivalent to a matrix BUB can be obtained from A by a finite sequence 
of operations, each being an elementary row or column operation. Show that matrix equivalence is 
an equivalence relation. 

3.71. Show that two consistent systems of linear equations have the same solution set if and only if their 
augmented matrices are row equivalent. (We assume that zero rows are added so that both aug- 
mented matrices have the same number of rows.) 



Answers to Supplementary Problems 

3.39. (i) "J («) Not defined. (iii) "J ^ 

3.40. (i) Not defined. (iii) (v) 

/-5 -2 4 5\ ,. , / 11 -12 0 -5\ 

(ll) U-3-12 18 J (lV) (-15 5 8 A) (vi) Not defined. 



1 0 
-1 3 

2 4 

3.42. (i) (a 1( a 2 , a 3 , a 4 ) (ii) (b v b 2 , b 3 , b 4 ) (iii) (c lt c 2 , e 3 , c 4 ) 

2 0 

3.44. (i) I 0 0 3 -6 1 I and \ 0 0 1 





2 


-1 


2 




0 


3 


-6 




0 


0 


-6 




3 


-2 


5 




-11 


10 


-15 




0 


0 


0 




3 


-1 






11 


-5 






0 


0 


1 




0 


0 






1 


3 






0 


-13 






0 


0 


35 




0 


0 


0/ 



4 -7 


4 


0 -6 




-3 12 








0 


4/3 


0 


0 



(v) 14 (vi) 




0 0 1 -l/6 y 

l\ /l 0 4/11 5/11 13/11 \ 

5 ) and (0 1 -10/11 15/11 -5/11 

0 0 0 0 / 



3.45. (i) " " " 0 and 



(ii) L" 11 and 



0 


4/11 




13/11 


1 


-5/11 




3/11 


0 


0 




0 


0 


0 




0 


1 


0 


0 




0 


1 


0 


) 


0 


0 






0 


0 


0/ 
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3 46 f° °V f° 1 V or ^\ where k is any scalar. 



'0 1 1\ 

3.48. [ 0 1 1 is upper triangular but not an echelon matrix. 
i0 0 1/ 



MM. (., 4- = (» J). A. - »), /<* - £ J); <«» ^) - (J J) 

3.52. Only matrices of the form ^ commute with ^ 0 . 

/I 2«\ 

3.53. A- = ( 0 j J 

(Hi) A2 = (o !)' A3 = (o 2?) 
/2» ON 



3.54. (i) A + B = 14 °) 
(ii) AB = 



(v) /(A) 



\ o m)) 



3.55. (i) #A = 



/14 ON 
V 0 33/ 

/3a, 
\36, 



3a, 3a 2 
36 2 



3a„ 
36„ 



) 



= 3A (ii) BD 



/3c, 3d,' 
3c 2 3d 2 i 



3£ 



\3c„ SdJ 



3.59. 



/ 5 -2N .... / 1/3 1/3 N 

<»> (_ 7 3 J (U) (-1/9 2/9 j 





/-5 


4 


-3\ 




3.60. (i) 


10 


-7 


6 


1 <«>( 




\ 8 


-6 


5/ 





8 -1 -3 1 
-5 12 
10 -1 -4, 



/31/2 -17/2 -11^ 

3.61. 9/2 -5/2 -3 
\-7 4 5/ 

3.62. Given AA~* = I. Then / = /* = (AA"i)' = (A^AK That is, (A"i)« = (A*)"*. 



- 0 



/ a 

0 a 



3.63. A is invertible iff each a t 0. Then A 1 = ' 2 

10 0 



. 0 



Chapter 4 



Vector Spaces and Subspaces 



INTRODUCTION 

In Chapter 1 we studied the concrete structures B n and C" and derived various proper- 
ties. Now certain of these properties will play the role of axioms as we define abstract 
"vector spaces" or, as it is sometimes called, "linear spaces". In particular, the conclu- 
sions (i) through (viii) of Theorem 1.1, page 3, become axioms [Ai]-[A 4 ], [Afi]-[M 4 ] below. 
We will see that, in a certain sense, we get nothing new. In fact, we prove in Chapter 5 
that every vector space over R which has "finite dimension" (denned there) can be identified 
with R" for some n. 

The definition of a vector space involves an arbitrary field (see Appendix B) whose 
elements are called scalars. We adopt the following notation (unless otherwise stated or 
implied): 

K the field of scalars, 
a, b,cork the elements of K, 

V the given vector space, 
u, v, w the elements of V. 

We remark that nothing essential is lost if the reader assumes that K is the real field R 
or the complex field C. 

Lastly, we mention that the "dot product", and related notions such as orthogonality, 
is not considered as part of the fundamental vector space structure, but as an additional 
structure which may or may not be introduced. Such spaces shall be investigated in the 
latter part of the text. 

Definition: Let A' be a given field and let 7 be a nonempty set with rules of addition and 
scalar multiplication which assigns to any u, v G V a sum u + v G V and to 
any u G V, k G K a product ku G V. Then V is called a vector space over K 
(and the elements of V are called vectors) if the following axioms hold: 

[Ai] : For any vectors u, v, w G V, (u + v) + w = u + (v + w). 

[A 2 ]: There is a vector in V, denoted by 0 and called the zero vector, for which u + 0 = u 
for any vector u GV. 

[As]: For each vector u GV there is a vector in V, denoted by —u, for which u + (—u) = 0. 

[A 4 ]: For any vectors u, v G V, u + v = v + u. 

[Mi\: For any scalar k G K and any vectors u, v G V, k(u + v) = ku + kv. 

[M 2 ] : For any scalars a, b G K and any vector uGV, (a + b)u = au + bu. 

[Ms]: For any scalars a,b G K and any vector u GV, (ab)u = a(bu). 

[M4]: For the unit scalar 1 G K, lu = u for any vector u GV. 
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The above axioms naturally split into two sets. The first four are only concerned with 
the additive structure of V and can be summarized by saying that V is a commutative group 
(see Appendix B) under addition. It follows that any sum of vectors of the form 

V1+V2+ ■ ■ ■ + Vm 

requires no parenthesis and does not depend upon the order of the summands, the zero 
vector 0 is unique, the negative —u of u is unique, and the cancellation law holds: 

u + w = v + w implies u = v 

for any vectors u,v,w G V. Also, subtraction is defined by 

u — V ' u + (—v) 

On the other hand, the remaining four axioms are concerned with the "action" of the 
field K on V. Observe that the labelling of the axioms reflects this splitting. Using these 
additional axioms we prove (Problem 4.1) the following simple properties of a vector space. 

Theorem 4.1: Let V be a vector space over a field K. 

(i) For any scalar JcGK and 0 G V, k0 = 0. 

(ii) For 0 G K and any vector u&V, Ou = 0. 

(iii) If ku = 0, where k G K and u G V, then k = 0 or u = 0. 

(iv) For any scalar k G K and any vector uEV, {-k)u = k(-u) = -ku. 



EXAMPLES OF VECTOR SPACES 

We now list a number of important examples of vector spaces. The first example is a 
generalization of the space R". 

Example 4.1: Let K be an arbitrary field. The set of all n-tuples of elements of K with vector 
addition and scalar multiplication defined by 

(a lt a 2 , a n ) + (6 t , b z , b n ) = + b lt a 2 +b 2 a„ + b„) 

and <*2 <0 = ( ka i> fe °2' • • • ' ka n) 

where a u fy, k e K, is a vector space over K; we denote this space by X". The zero 
vector in tf« is the w-tuple of zeros, 0 = (0, 0, ... , 0). The proof that K« is a vector 
space is identical to the proof of Theorem 1.1, which we may now regard as stating 
that B" with the operations defined there is a vector space over R. 



Example 4.2: Let V be the set of all m X n matrices with entries from an arbitrary field K. Then 
V is a vector space over K with respect to the operations of matrix addition and 
scalar multiplication, by Theorem 3.1. 



Example 4.3: Let V be the set of all polynomials a 0 + a t t + o 2 t 2 + • ■ + a n t» with coefficients ct { 
from a field K. Then V is a vector space over K with respect to the usual operations 
of addition of polynomials and multiplication by a constant. 

Example 4.4: Let K be an arbitrary field and let X be any nonempty set. Consider the set V of all 
functions from X into K. The sum of any two functions f,g eV is the function 
/ + g G V defined by 

+ *)<«) = /<*> + *(*) 
and the product of a scalar keK and a function / € V is the function kfeV 
defined by , „ , 

(kmx) = km 
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Then V with the above operations is a vector space over K (Problem 4.5). The zero 
vector in V is the zero function 0 which maps each x G X into 0 S K: 0{x) = 0 
for every x&X. Furthermore, for any function / £ V, —f is that function in V 
for which (—/)(«) = —/(*), for every x G X. 



Example 45: Suppose £ is a field which contains a subfield K. Then E can be considered to be a 
vector space over K, taking the usual addition in E to be the vector addition and 
denning the scalar product kv of k&K and v S E to be the product of k and v 
as element of the field E. Thus the complex field C is a vector space over the real 
field E, and the real field R is a vector space over the rational field Q. 



SUBSPACES 

Let W be a subset of a vector space over a field K. W is called a subspace of V if W is 
itself a vector space over K with respect to the operations of vector addition and scalar 
multiplication on V. Simple criteria for identifying subspaces follow. 

Theorem 4.2: W is a subspace of V if and only if 

(i) W is nonempty, 

(ii) W is closed under vector addition: v, w G W implies v + w G W, 

(iii) W is closed under scalar multiplication: v G W implies kv G W for 
every k G K. 

Corollary 4.3: W is a subspace of V if and only if (i) 0 € W (or W ¥• 0), and (ii) v, w G W 
implies av + bw G W for every a, b G K. 

Example 4.6: Let V be any vector space. Then the set {0} consisting of the zero vector alone, and 
also the entire space V are subspaces of V. 



Example 4.7 : (i) Let V be the vector space R 3 . Then the set W consisting of those vectors whose 
third component is zero, W — {(a, b, 0) : a, b G R}, is a subspace of V. 

(ii) Let V be the space of all square n X n matrices (see Example 4.2). Then the 
set W consisting of those matrices A = (o y ) for which o y = a n , called 
symmetric matrices, is a subspace of V. 

(iii) Let V be the space of polynomials (see Example 4.3). Then the set W consisting 
of polynomials with degree — n, for a fixed n, is a subspace of V. 

(iv) Let V be the space of all functions from a nonempty set X into the real field R. 
Then the set W consisting of all bounded functions in V is a subspace of V. 
(A function / € V is bounded if there exists MSB such that |/(a;)| - M for 
every x € X.) 

Example 4.8: Consider any homogeneous system of linear equations in n unknowns with, say, real 
coefficients: 

a U x l + «12 a; 2 + ■ • • + o, ln x n = 0 
a 2l x l <*22*2 + • • • + a 2n x n — 0 



+ a mi x 2 + • • • + a mn x n - 0 

Recall that any particular solution of the system may be viewed as a point in R". 
The set W of all solutions of the homogeneous system is a subspace of R™ (Problem 
4.16) called the solution space. We comment that the solution set of a nonhomo- 
geneous system of linear equations in n unknowns is not a subspace of R™. 
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Example 4.9: Let U and W be subspaees of a vector space V. We show that the intersection 
VnW is also a subspace of V. Clearly 0 G U and 0 S W since U and W are sub- 
spaces; whence O&UnW. Now suppose u,v G C/nW. Then u,v &U and v S TV 
and, since ?7 and W are subspaees, 

au + bv G U and au + bv e W 

for any scalars a,b€K. Accordingly, au + bv e UnW and so [7nTF is a sub- 
space of V. 



The result in the preceding example generalizes as follows. 



Theorem 4.4: The intersection of any number of subspaees of a vector space V is a 
subspace of V. 



LINEAR COMBINATIONS, LINEAR SPANS 

Let V be a vector space over a field K and let v u . . . , v m e V. Any vector in V of the 
form 

aiVi + CL2V2 + • • • + a m v m 

where the au e K, is called a linear combination of vi, v m . The following theorem 
applies. 

Theorem 4.5: Let S be a nonempty subset of V. The set of all linear combinations of 
vectors in S, denoted by L(S), is a subspace of V containing S. Further- 
more, if W is any other subspace of V containing S, then L(S) cW. 

In other words, L(S) is the smallest subspace of V containing S; hence it is called the 
subspace spanned or generated by S. For convenience, we define = {0}. 



Example 4.10: Let V be the vector space R 3 . The linear span of any nonzero vector u consists 
of all scalar multiples of u; geometrically, it is the line through the origin and the 
point m. The linear space of any two vectors u and v which are not multiples of 
each other is the plane through the origin and the points u and v. 




Example 4.11: The vectors e t = (1,0,0), e 2 = (0,1,0) and e 3 = (0,0,1) generate the vector space 
R3. For any vector (a, 6, e) GB 3 is a linear combination of the e ( ; specifically, 

(a, b, e) = a(l, 0, 0) + 6(0, 1, 0) + c(0, 0, 1) 
= ae 1 + be 2 + ce 3 



Example 4.12: The polynomials 1, t, t 2 , t 3 , • . . generate the vector space V of all polynomials 
(in*): V = L(l, t,t 2 , . . .). For any polynomial is a linear combination of 1 and 
powers of t. 
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Example 4.13: Determine whether or not the vector v = (3, 9, -4, -2) is a linear combination of 
the vectors u t = (1, -2, 0, 3), w 2 = (2, 3, 0, -1) and u 3 = (2, -1, 2, 1), i.e. belongs 
to the space spanned by the iq. 

Set v as a linear combination of the u { using unknowns y and z; that is, set 

v = xu x + yu 2 + zu s : 

(3, 9, -4, -2) = a;(l, -2, 0, 3) 4 2/(2, 3, 0, -1) + z(2, -1, 2, 1) 
= (x + 2y + 2z, -2x + 3y-z, 2z, 3x-y + z) 

Form the equivalent system of equations by setting corresponding components equal 
to each other, and then reduce to echelon form: 

x + 2y + 2z = 3 x + 2y + 2z = 3 x + 2y + 2z - 3 

-2x + Sy- z = 9 7y + 3z = 15 1y + 3z - 15 

or or 
2z - -4 2z - -4 2z = -4 

3* - y + z = -2 -7j/ - 5z = -11 -2z = 4 



x + 2y + 2z = 3 
or 7y + Sz - 15 

2z = -4 

Note that the above system is consistent and so has a solution; hence v is a linear 
combination of the u { . Solving for the unknowns we obtain x = 1, y = 3, z — —2. 
Thus v — u t + 3m 2 — 2u 3 . 

Note that if the system of linear equations were not consistent, i.e. had no solu- 
tion, then the vector v would not be a linear combination of the tt { . 



ROW SPACE OF A MATRIX 

Let A be an arbitrary mxn matrix over a field K: 





fan On 


Ol„ 




0,21 0/22 






\Oml Om2 


Omn 



The rows of A, 

Ri = (an, a 2 i, . . . , am), . . . , R m = (ami, Om2, . . . , dw) 

viewed as vectors in K n , span a subspace of K n called the row space of A. That is, 

row space of A = L(R U R 2 , . . . , R m ) 

Analogously, the columns of A, viewed as vectors in K m , span a subspace of K m called the 

column space of A. 

Now suppose we apply an elementary row operation on A, 

(i) Ri Rj, (ii) Ri -* kRi, k¥>0, or (iii) R t -> UR, + Ri 

and obtain a matrix B. Then each row of B is clearly a row of A or a linear combination of 
rows of A. Hence the row space of B is contained in the row space of A. On the other 
hand, we can apply the inverse elementary row operation on B and obtain A; hence the row 
space of A is contained in the row space of B. Accordingly, A and B have the same row 
space. This leads us to the following theorem. 

Theorem 4.6: Row equivalent matrices have the same row space. 

We shall prove (Problem 4.31), in particular, the following fundamental result con- 
cerning row reduced echelon matrices. 
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Theorem 4.7: Row reduced echelon matrices have the same row space if and only if they 
have the same nonzero rows. 

Thus every matrix is row equivalent to a unique row reduced echelon matrix called its 
row canonical form. 

We apply the above results in the next example. 

Example 4.14: Show that the space U generated by the vectors 

m, = (1, 2, -1, 3), u 2 = (2, 4, 1, -2), and u s = (3, 6, 3, -7) 
and the space V generated by the vectors 

v, = (1, 2, -4, 11) and v 2 = (2, 4, -5, 14) 

are equal; that is, U = V. 

Method 1. Show that each u { is a linear combination of v t and v 2 , and show that 
each v t is a linear combination of u v u 2 and w 3 . Observe that we have to show that 
six systems of linear equations are consistent. 

Method 2. Form the matrix A whose rows are the w i( and row reduce A to row 
canonical form: 

2 -1 3\ /l 2 -1 S\ /l 2 -1 

A = I 2 4 1 -2 to 0 0 3 -8 to 0 0 3 -8 




6 3-7/ \0 0 6 -16/ \0 0 0 0 

to ! o o 

0 

Now form the matrix B whose rows are v, and v 2 , and row reduce B to row canonical 
form: 

/l 2 -4 11 \ (\ 2 -4 11\ /l 2 0 1/8 \ 

B = ( 2 4-5 Uj *° U 0 3 - 8 j t0 U 0 1 -S/Sj 

Since the nonzero rows of the reduced matrices are identical, the row spaces of A 
and B are equal and so U = V. 



SUMS AND DIRECT SUMS 

Let U and W be subspaces of a vector space 7. The sum of U and W, written U + W, 
consists of all sums u + w where u e U and w GW: 

JJ + W - {u + w:uGU,wGW} 

Note that 0 = 0 + 0 G U + W, since 0 G tf, 0 G W. Furthermore, suppose u + w and 
u' + w' belong to U + W, with u,u' &U and w,w' G W. Then 

(u + w) + (u' + w') = (u + u') + (w + w') G U + W 

and, for any scalar k, k(u + w) = ku + kwGU + W 

Thus we have proven the following theorem. 

Theorem 4.8: The sum U + W of the subspaces U and W of V is also a subspace of V. 

Example 4.15: Let V be the vector space of 2 by 2 matrices over R. Let U consist of those 
matrices in V whose second row is zero, and let W consist of those matrices in V 
whose second column is zero: 
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Now U and W are subspaces of V. We have: 

u+w = {(: o) : °' 6 '* eK } - d = {(: o°) : aeR } 

That is, U + W consists of those matrices whose lower right entry is 0, and UnW 
consists of those matrices whose second row and second column are zero. 

Definition: The vector space V is said to be the direct sum of its subspaces U and W, 
denoted by 

V = U e W 

if every vector v G V can be written in one and only one way as v = u + w 
where u G U and w G W. 

The following theorem applies. 

Theorem 4.9: The vector space V is the direct sum of its subspaces U and W if and only 
if: (i) V = U+ W, and (ii) UnW = {0}. 

Example 4.16: In the vector space R 3 , let U be the xy plane and let W be the yz plane: 

U = {(a, b, 0) : a,be R} and W = {(0, b,c) : b,c& R} 

Then R 3 = U + W since every vector in R 3 is the sum of a vector in U and a vector 
in W. However, R 3 is not the direct sum of U and W since such sums are not 
unique; for example, 

(3, 5, 7) = (3, 1, 0) + (0, 4, 7) and also (3, 5, 7) = (3, -4, 0) + (0, 9, 7) 

Example 4,17: In R 3 , let U be the xy plane and let W be the z axis: 

U = {(a, 6,0): o,6€R} and W = {(0, 0, c) : c G R} 

Now any vector (a, 6, e) G R s can be written as the sum of a vector in U and a 
vector in V in one and only one way: 

(a, 6, c) = (a, b, 0) + (0, 0, c) 

Accordingly, R 3 is the direct sum of U and W, that is, R 3 = U ® W. 



Solved Problems 

VECTOR SPACES 

4.1. Prove Theorem 4.1: Let V be a vector space over a field K. 

(i) For any scalar JcGK and 0£7, kO = 0. 

(ii) For 0 G K and any vector itGF, (ht = 0. 

(iii) If ku — 0, where k G K and u£7, then = 0 or u = 0. 

(iv) For any k G K and any iteF, = fc(-it) = - ku. 

(i) By axiom [A 2 ] with m = 0, we have 0 + 0 = 0. Hence by axiom [MJ, fcO = fc(0 + 0) = 
A;0 + fcO. Adding — fcO to both sides gives the desired result. 

(ii) By a property of K, 0 + 0 = 0. Hence by axiom [M 2 ], Ow = (0 + 0)w = Qu + Ou. Adding - 0w 
to both sides yields the required result. 
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(iii) Suppose ku = 0 and k 0. Then there exists a scalar A;" 1 such that k l k = 1; hence 

w = lu = (fc- x fc)tt = fc-^fett) = fc^O = 0 

(iv) Using m + {-u) = 0, we obtain 0 = fcO = k(u + (-m)) = ku + k(-u). Adding -ku to both 
sides gives —ku = ft(— tt). 

Using fc + (— *) = 0, we obtain 0 = Ou = (k + (— fe))w = few + (-fc)u. Adding -ku to both 
sides yields -ku = (-fe)u. Thus (— k)u = k(—u) - -ku. 

4.2. Show that for any scalar k and any vectors u and v, k(u — v) - ku — kv. 

Using the definition of subtraction («-« = «+ (-• i>)) and the result of Theorem 4.1(iv) 
W-v) = -kv), 

k(u - v) = k(u + (-v)) = ku + k(-v) = ku + (-kv) = ku - kv 

4.3. In the statement of axiom [M 2 ], (a + b)u = au + bu, which operation does each plus 
sign represent? 

The + in (a + b)u denotes the addition of the two scalars a and 6; hence it represents the addi- 
tion operation in the field K. On the other hand, the + in au + bu denotes the addition of the two 
vectors au and bu; hence it represents the operation of vector addition. Thus each + represents a 
different operation. 

4.4. In the statement of axiom [Ms], (ab)u = a{bu), which operation does each product 
represent? 

In (ab)u the product ab of the scalars o and 6 denotes multiplication in the field K, whereas the 
product of the scalar ab and the vector u denotes scalar multiplication. 

In a(bu) the product 6m of the scalar 6 and the vector u denotes scalar multiplication; also, the 
product of the scalar a and the vector bu denotes scalar multiplication. 

4.5. Let V be the set of all functions from a nonempty set X into a field K. For any func- 
tions f,g<ZV and any scalar k G K, let f + g and kf be the functions in V defined 
as follows: 

(/ + £)(*) = f{x) + g{x) and (kf){x) = kf(x), Vx G X 
(The symbol V means "for every".) Prove that V is a vector space over K. 

Since X is nonempty, V is also nonempty. We now need to show that all the axioms of a vector 
space hold. 

[A J: Let f.g, h e V. To show that (/ + g) + h = / + (g + h), it is necessary to show that 
the function (/ + g) + h and the function / + (g + h) both assign the same value to each 
x e X. Now, 

((f + g) + h)(x) = (f + g)(x) + h(x) = (/(»)+ g(x)) + h(x), Va; G X 
(f+(g + h))(x) = fix) + (g + h)(x) = f(x) + (g(x) + h(x)), Vj£Z 
But fix), g(x) and h(x) are scalars in the field K where addition of scalars is associative; hence 

(/(») + g(x)) + h(x) = fix) + (g(x) + h(x)) 

Accordingly, (/ + g) + h = / + (g + h). 

[A 2 ]: Let 0 denote the zero function: 0(«) = 0,Vx£ X. Then for any function / G V, 
if + 0)(«) = /(*) + 0(«) = fix) + 0 = fix), Va; G X 
Thus f + 9 = f, and 0 is the zero vector in V. 
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[A 3 ]: For any function / G V, let -/ be the function denned by (-/)(*) = - fix). Then, 
(/ + (-/))(*) = f(x) + (-/)(*) = f(x) - /(*) = 0 = 0(x), Mx&X 
Hence / + (-/) = 0. 

[A 4 ]: Let f.gGV. Then 

(/ + g)(x) = f{x) + gix) = gix) + fix) = ig + /)(»), Vz S X 

Hence / + g = g + f. (Note that fix) + gix) = gix) + fix) follows from the fact that fix) and 
gix) are scalars in the field K where addition is commutative.) 

[AfJ: Let f.geV and k G K. Then 

W + </))(*) = *((/ + »)(*)) = kifix) + gix)) = kfix) + kgix) 
= (fc/)(x) + ikg)ix) = ikf + kg)ix), VxeX 

Hence kif + g) = kf + kg. (Note that &(/(*) + gix)) = kfix) + kgix) follows from the fact that 
k, fix) and gix) are scalars in the field K where multiplication is distributive over addition.) 

[ilfj: Let /ey and a.beK. Then 

((a +&)/)(») = ia+b)fix) = afix) + bfix) = iaf)ix) + bfix) 
= iaf+bf)ix), VxGX 

Hence (a + b)f = af + bf. 

[Af s ]: Let /6V and a,b&K. Then, 

iiab)f)ix) = (a6)/(a;) = a(&/(*)) = o(6/)(a;) = (a(&/))(z), Vi£X 
Hence (a6)/ = a(6/). 

[Afj: Let / € V. Then, for the unit \GK, Hf)ix) = l/(») = fix), Vx e X. Hence 1/ = /. 
Since all the axioms are satisfied, V is a vector space over X. 

4.6. Let V be the set of ordered pairs of real numbers: V = {(a, b) : a, b G R}. Show 
that V is not a vector space over R with respect to each of the following operations 
of addition in V and scalar multiplication on V: 

(i) (a, 6) + (c, d) = (a + c, b + d) and k{a, b) = {ka, 6); 

(ii) (a, 6) + (c, d) = (a, b) and fe(a, b) = (ka, kb); 

(iii) (a, b) + (c, d) = (a + e, b + d) and k(a, b) = (fc 2 a, k 2 b). 

In each case show that one of the axioms of a vector space does not hold. 

(i) Let r-X, 8 = 2, v = (3, 4). Then 

ir + s)v = 3(3,4) = (9,4) 
rv + sv = 1(3, 4) + 2(3, 4) = (3, 4) + (6, 4) = (9, 8) 
Since (r + s)v # rv + ev, axiom [M 2 ] does not hold. 

(ii) Let v = (1,2), to = (3,4). Then 

v + w = (1, 2) + (3, 4) 

w + v = (3, 4) + (1,2) 
Since v + w ^ w + v, axiom [A 4 ] does not hold. 

(iii) Let r = 1, s = 2, « = (3, 4). Then 

ir + s)v = 3(3,4) = (27,36) 
rv + sv = 1(3, 4) + 2(3, 4) = (3, 4) + (12, 16) = (15, 20) 
Thus (r + s)v rv + sv, and so axiom [Af 2 ] does not hold. 



= (1.2) 
= (3,4) 
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SUBSPACES 

4.7. Prove Theorem 4.2: Wis a subspace of V if and only if (i) W is nonempty, (ii) v,wGW 
implies v + wGW, and (iii) v G W implies kv G W for every scalar kGK. 

Suppose W satisfies (i), (ii) and (iii). By (i), W is nonempty; and by (ii) and (iii), the operations 
of vector addition and scalar multiplication are well defined for W. Moreover, the axioms [A J, [A 4 ], 
[Mi], [M 2 ], [M s ] and [Af 4 ] hold in W since the vectors in W belong to V. Hence we need only show 
that [A 2 ] and [A„] also hold in W. By (i), W is nonempty, say uGW. Then by (iii), Ou - 0 G W 
and v + 0 = v for every v G W. Hence satisfies [A 2 ]. Lastly, if v G W then (-l)v = -« G W 
and v + (-«) = 0; hence W satisfies [A 3 ]. Thus W is a subspace of V. 

Conversely, if W is a subspace of V then clearly (i), (ii) and (iii) hold. 



4.8. Prove Corollary 4.3: W is a subspace of V if and only if (i) 0 E W and (ii) v,wGW 
implies av + bw GW for all scalars a, b G K. 

Suppose W satisfies (i) and (ii). Then, by (i), W is nonempty. Furthermore, if v,w G W then, 
by (ii), v + w = lv + lwGW; and if « G W and fcGX then, by (ii), kv = kv + 0v G W. Thus 
by Theorem 4.2, is a subspace of V. 

Conversely, if W is a subspace of V then clearly (i) and (ii) hold in W. 



4.9. Let V = R 3 . Show that W is a subspace of 7 where: 

(i) w = {{a, b,0): a,b G R}, i.e. W is the *2/ plane consisting of those vectors whose 
third component is 0; 

(ii) W = {(a,b,c): a + b + c = 0}, i.e. W consists of those vectors each with the 
property that the sum of its components is zero. 

(i) 0 = (0, 0, 0) € W since the third component of 0 is 0. For any vectors v = (a, b, 0), w = 
(c, d, 0) in W, and any scalars (real numbers) k and k', 

kv + k'w = k(a, b, 0) + k'(e, d, 0) 

= (ka, kb, 0) + (k'c, k'd, 0) = (ka + k'c, kb + k'd, 0) 

Thus kv + k'w G W, and so W is a subspace of V. 

(ii) 0 = (0, 0,0)6W since 0 + 0 + 0 = 0. Suppose v = (o, 6,c), w = (a', 6', e') belong to TP, i.e. 
a + ft + c = 0 and a' + 6' + C = 0. Then for any scalars k and fc', 

fcu + fc'w = fc(a, &, c) + k'(a', b', c') 

= (ka, kb, kc) + (fcW, k'b', k'c') 
= (ka + k'a', kb + fc'6', fee + k'c') 

and furthermore, 

(fco + fc'o') + (kb + k'b 1 ) + (fco + fc'C) = k(a+b + c) + k'(a' + b' + c') 

= fcO + fc'O = 0 

Thus kv + k'w G W, and so W is a subspace of V. 

4.10. Let V = R 3 . Show that W is not a subspace of 7 where: 

0) _ {( ttj b> C ) : a ^ 0}, i.e. W consists of those vectors whose first component is 
nonnegative; 

(ii) W = {(a, b,c): tf + W + c*^ 1>, i.e. W consists of those vectors whose length does 

Hot 6XCG©d 1 " 

(iii) W = {(a, b, c) : a,b,cG Q>, i.e. W consists of those vectors whose components are 
rational numbers. 

In each case, show that one of the properties of, say, Theorem 4.2 does not hold, 
(i) „ = (1,2,3) GW and k — —5 G R. But kv = -5(1, 2, 3) = (-5, -10, -15) does not belong to 
W since -5 is negative. Hence W is not a subspace of V. 
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(ii) v = (1, 0, 0) S W and w = (0, 1, 0) £ W. But v + w = (1, 0, 0) + (0, 1, 0) = (1, 1, 0) does not 
belong to W since l 2 + l 2 + 0 2 = 2 > 1. Hence W is not a subspace of V. 

(iii) v = (1,2,3) GW and k = y/2GK. But kv = y/2 (1,2,3) = (\/2, 2V2, 3^2) does not belong to 
TP since its components are not rational numbers. Hence W is not a subspace of V. 



4.11. Let V be the vector space of all square n x n matrices over a field K. Show that W 
is a subspace of V where: 

(i) W consists of the symmetric matrices, i.e. all matrices A = (a tj ) for which 
fflji = cm; 

(ii) W consists of all matrices which commute with a given matrix T; that is, 
W = {AGV: AT = TA}. 

(i) 0 £ W since all entries of 0 are 0 and hence equal. Now suppose A = (a i} ) and B = (by) 
belong to W, i.e. a Si = a tj and 6 j{ = 6 tJ . For any scalars a t b E.K, aA + bB is the matrix 
whose ii-entry is aa i} + 66 y . But aa }i + bb j{ = aa tj + bb {j . Thus aA + 6£ is also symmetric, 
and so IF is a subspace of V. 

(ii) OeW since 0T = 0 = TO. Now suppose A,BGW; that is, AT — TA and BT = TB. For 
any scalars a,b &K, 

(aA + bB)T = (aA)T + (bB)T = a(AT) + b(BT) = a(TA) + b(TB) 

= T(aA) + T(bB) = T(aA + bB) 

Thus aA + bB commutes with T, i.e. belongs to W; hence W is a subspace of V. 



4.12. Let V be the vector space of all 2 x 2 matrices over the real field R. Show that W 
is not a subspace of V where: 

(i) W consists of all matrices with zero determinant; 

(ii) W consists of all matrices A for which A 2 = A. 

(i) (Recall that det^" ad - bo.) The matrices A = ^ and B = ^ belong 

to W since det(A) = 0 and det(B) = 0. But A + B = f Q ^ does not belong to W since 
det (A+B) = 1. Hence W is not a subspace of V. ^ 

(ii) The unit matrix 1 - belongs to W since 
But 2/ = ^ g g y does not belong to W since 

«■ ■ C I) - C 2) - 21 

Hence W is not a subspace of V. 



4.13. Let V be the vector space of all functions from the real field R into R. Show that W 
is a subspace of V where: 

(i) w = {/ : /(3) = 0}, i.e. W consists of those functions which map 3 into 0; 

(ii) W = {f: /(7) = /(l)}, i.e. W consists of those functions which assign the same 
value to 7 and 1; 

(iii) W consists of the odd functions, i.e. those functions / for which /(-*) = - /(*)• 
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Here 0 denotes the zero function: 0(x) = 0, for every x S R. 

(i) 0 6 W since 0/3) = 0. Suppose f.g&W, i.e. /(3) = 0 and g(Z) = 0. Then for any real 

numbers a and b, , ,,„ x 

(o/ + 6«r)(3) = a/(3) + b£f(3) = a0 + 60 = 0 

Hence af + bg e W, and so W is a subspace of V. 

(ii) OeW since 0(7) = 0 = 0(1). Suppose f.g&W, i.e. /(7) = /(l) and «r(7) = flr(l). Then, for 
any real numbers a and b, 

(a/ + 6tf)(7) = o/(7) + 6flr(7) = o/(l) + bg(l) = (af + bg)(l) 
Hence af + bgG W, and so W is a subspace of V. 

(iii) OeW since 0(-») = 0 = -0 = -0(*). Suppose f,g&W, i.e. f(-x) - - f(x) and g(-x) = 
— g(x). Then for any real numbers a and b, 

(af+bg)(-x) = af(-x) + bg(-x) = -af(x)-bg(x) = - (af(x) + bg(x)) = - (af + bg)(x) 
Hence af + bg € W, and so W is a subspace of V. 

4.14. Let V be the vector space of all functions from the real field R into R. Show that W 
is not a subspace of V where: 

(i) W={f: /(7) = 24-/(1)}; 

(ii) W consists of all nonnegative functions, i.e. all function / for which f(x) ^ 0, 
V*GR. 

(i) Suppose f.geW, i.e. /(7) = 2 + /(l) and g(7) = 2 + </(l). Then 

tf + 9)(D = fiD + = 2 + f(l) + 2 + fir(l) 

= 4 + /(l) + fir(l) = 4 + (f + g)(l) 2 + (f + g)(l) 
Hence / + g G W, and so W is not a subspace of V. 

(ii) Let k = -2 and let /£F be defined by /(*) = * 2 . Then f <= W since /(«) = » 2 - 0, 
V* € E. But (fc/)(5) = fc/(5) = (-2)(5 2 ) = -50 < 0. Hence kf « W, and so W is not a sub- 
space of V. 

4.15. Let V be the vector space of polynomials Oo + ait + a 2 t 2 + ■■■ + a n t n with real coef- 
ficients, i.e. Oi € R. Determine whether or not W is a subspace of V where: 

(i) W consists of all polynomials with integral coefficients; 

(ii) W consists of all polynomials with degree ^ 3; 

(iii) W consists of all polynomials b 0 + bit 2 + bit* + ■ ■ ■ + b n t 2n , i.e. polynomials with 
only even powers of t. 

(i) No, since scalar multiples of vectors in W do not always belong to W. For example, v = 
3 + 5t + 7t 2 G w but iv - f + ft + |« 2 € W. (Observe that W is "closed" under vector 
addition, i.e. sums of elements in W belong to W.) 

(ii) and (iii). Yes. For, in each case, W is nonempty, the sum of elements in W belong to W, and 
the scalar multiples of any element in W belong to W. 

4.16. Consider a homogeneous system of linear equations in n unknowns x u . . . , x n over a 

auXi + 0,12X2 + • • • + «iA = 0 

aziXl + 022X2 + • ■ ■ + d 2 nXn = 0 



OnnSl + a m 2X2 + • ■ • + OmnXn = 0 

Show that the solution set W is a subspace of the vector space K n . 
0 = (0, 0, . . . , 0) S W since, clearly, 

a u 0 + a i2 0+ •■■ + a ( „0 = 0, for t = 1, . . .,m 
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Suppose u = («!, m 2 , . . m„) and u = (v,, » 2 , . . ., v n ) belong to W, i.e. for i — 1, . . ., m 

o u w, + o j2 M 2 + • ■ • + a in M n = 0 

a n v i + a i2 v 2 + • • • + a,i„v n = 0 
Let a and 6 be scalars in K . Then 

au + 6« = (<Mt x + 6v 1 , ait 2 + bv 2 , . . . , au n + bv n ) 

and, for * = 1, ...,m, 

a u (aU] + ft^) + a i2 (att 2 + ftt^ + • • ■ + a in (att n + 6v n ) 

= a(a 41 % + + 1- a in u^ + b(a a vi + a i2 v 2 + • • • + a in v„) 

= aO + 60 = 0 

Hence au + 6v is a solution of the system, i.e. belongs to W. Accordingly, If is a subspace of K n . 
LINEAR COMBINATIONS 

4.17. Write the vector v = (1, -2, 5) as a linear combination of the vectors ei = (1, 1, 1), 
e 2 = (1, 2, 3) and e 3 = (2, -1, 1). 

We wish to express v as v = xe t + ye 2 + ze 3 , with x, y and z as yet unknown scalars. Thus 
we require 

(1, -2, 5) = x(l, 1, 1) + 1/(1, 2, 3) + z(2, -1, 1) 
= (*, x, x) + (y, 2y, Sy) + (2z, -z, z) 
= (x + y + 2z, x + 2y — z, x + Sy + z) 

Form the equivalent system of equations by setting corresponding components equal to each other, 
and then reduce to echelon form: 

* + y + 2z = 1 x + y + 2z - 1 * + y + 2z = 1 

x + 2y - z = -2 or y — 3z = -3 or y - 3z = -3 

x + Sy + z = 5 2y- z = 4 5z=10 

Note that the above system is consistent and so has a solution. Solve for the unknowns to obtain 
x - —6, y = 3, * = 2. Hence v = — 6ej + 3e 2 + 2e 3 . 

4.18. Write the vector v = (2, -5, 3) in R 3 as a linear combination of the vectors d = 
(1,-3,2), e 2 = (2,-4,-1) and e 3 = (l,-5,7). 

Set v as a linear combination of the e 4 using the unknowns x, y and z: v - xe Y + ye 2 + ze 3 . 

(2, -5, 3) = «(1, -3, 2) + y{2, -4, -1) + z(l, -5, 7) 

= (x + 2y + z, -3x - 4j/ - 5z, 2x - j/ + 7z) 

Form the equivalent system of equations and reduce to echelon form: 

x + 2y+z=2 x + 2y+ z = 2 K + 2y+z = 2 

-3* — 4y — 5z = -5 or 2j/ - 2z = 1 or 2# - 2z = 1 

2x - y + 7z = 3 -5y + 5z = -1 0 = 3 

The system is inconsistent and so has no solution. Accordingly, v cannot be written as a linear com- 
bination of the vectors e lt e 2 and e 3 . 

4.19. For which value of k will the vector u = (1, -2, k) in R 3 be a linear combination of 
the vectors v = (3, 0, -2) and w = (2, -1, -5) ? 

Set u = xv + yw: 

(1, -2, fe) = »(3, 0, -2) + »(2, -1, -5) = (3a + 2y, -y, -2x - 5*/) 

Form the equivalent system of equations: 

3x + 2y = 1, -y - -2, -2x - 5y = k 
By the first two equations, x = -1, y = 2. Substitute into the last equation to obtain k = -8. 
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4.20. Write the polynomial v = t 2 + At — 3 over R as a linear combination of the poly- 
nomials ei = t 2 - 2t + 5, e 2 = 2t 2 - St and e a = t + 3. 

Set d as a linear combination of the e t using the unknowns x, y and z: v = xe x + j/e 2 + ze 3 . 
«2 + 4t - 3 = x(t 2 -2t + 5) + y(2t 2 -3t) + z(t + 3) 

= xt* - 2xt + 5x + 2yt 2 - 3yt + zt + 3z 
= (x + 2j/)t 2 + (-2* - 3j/ + z)t + (5a; + 3z) 
Set coefficients of the same powers of t equal to each other, and reduce the system to echelon form: 
x + Zy =1 x + 2y =1 x + 2y = 1 

—2x — Sy + z = 4 or y + z = 6 or 2/ + z= 6 

5a; + 3z = -3 -lOy + 3z = -8 13z = 52 

Note that the system is consistent and so has a solution. Solve for the unknowns to obtain 
x = -3, y = 2, z = 4. Thus v = -3^ + 2c 2 + 4e 3 . 



4.21. Write the matrix E = Yj asa l mear combination of the matrices A = 

Set £ as a linear combination of A,B,C using the unknowns x,y,z: E = xA + yB + zC. 

(i -!) = •(! o) + *(J J) + •(! -J) 

fx x\ /0 0\ /0 2z\ _ / x a; + 2z\ 
\x o) \V v) \0 -z) \x + y y-zj 

Form the equivalent system of equations by setting corresponding entries equal to each other: 

a; = 3, x + y = 1, x + 2z = 1, y — z = — 1 

Substitute « = 3 in the second and third equations to obtain y = — 2 and z = —1. Since these 
values also satisfy the last equation, they form a solution of the system. Hence E = ZA — 2B — C. 



4.22. Suppose u is a linear combination of the vectors Vi, . . . , v m and suppose each Vi is a 
linear combination of the vectors toi, .. ., w,: 

u = aiVi + a 2 V2 + • • • + CLmVm and Vi = bnWi + b i2 w 2 + • • • + b in w n 
Show that u is also a linear combination of the wi. Thus if ScL{T), then L(S)<zL(T). 

u = a^Vi + a 2 v 2 + • • • + a m v m 

= OiibuWi + • • • + b ln w n ) + a 2 (b 21 w 1 + • • • + b 2n w n ) + • • • + a m (b ml Wx + • • • + b mn w n ) 
= (a 1 6 11 + a 2 b 21 + • • • + a m 6 mi )wi + • • • + (ai& ln + a 2 b 2n + • • • + a m b mn )w n 



or simply 




LINEAR SPANS, GENERATORS 

4.23. Show that the vectors u = (1, 2, 3), v = (0, 1, 2) and w = (0, 0, 1) generate R 3 . 

We need to show that an arbitrary vector (o, b, c) S R 3 is a linear combination of u, v and w. 
Set (a, 6, c) = xu + yv + zw: 

(a, b, c) = x(l, 2, 3) + 3/(0, 1, 2) + z(0, 0, 1) = (x, 2x + y,3x + 2y + z) 
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Then form the system of equations 

x = a z + 2y + 3x = c 

2x + y = 6 or y + 2x — b 

3x + 2y + z = e x = a 

The above system is in echelon form and is consistent; in fact x — a, y — b — 2a, z = c — 26 + a 
is a solution. Thus tt, v and w generate R 3 . 



4.24. Find conditions on a, b and c so that (a, b, e) € R 3 belongs to the space generated by 
u = (2, 1, 0), v = (1, -1, 2) and w = (0, 3, -4). 

Set (a, 6, c) as a linear combination of u, v and 10 using unknowns x, y and z: (a, 6, c) = 

aw + yv + zw. 

(a, 6, e) = a;(2, 1, 0) + y(l, -1, 2) + z(0, 3, -4) = (2x + y, x - y + Sz, 2y - 4z) 

Form the equivalent system of linear equations and reduce it to echelon form: 

2x + y =a 2x + y -a 2x + y =a 

x - y + 3z = 6 or Zy - 6z = a - 26 or 3y - 6z = a - 26 

2j/ - 4z = c 2j/ - 4z = c 0 = 2a — 46 - 3c 

The vector (a, 6, c) belongs to the space generated by u, v and w if and only if the above system is 
consistent, and it is consistent if and only if 2a - 46 - 3c = 0. Note, in particular, that u, v and 
w do not generate the whole space R 3 . 

4.25. Show that the xy plane W = {(a, b, 0)} in R 3 is generated by u and v where: (i) u = 
(1,2,0) and v = (0,1,0); (ii) u = (2, -1, 0) and v = (1,3,0). 

In each case show that an arbitrary vector (a, b, 0) e W is a linear combination of u and v. 

(i) Set (a, 6, 0) = xu + yv: 

(a, 6, 0) = x{l, 2, 0) + 3/(0, 1, 0) = (x, 2x + y, 0) 

Then form the system of equations 

x = a y + 2x = 6 

2x + y = 6 or » = a 

0 = 0 

The system is consistent; in fact x - a, y = b - 2a is a solution. Hence u and v generate W. 

(ii) Set (o, 6, 0) = xu + yv: 

(a, b, 0) = x(2, -1, 0) + y(l, 3, 0) = (2x + y,-x + Sy, 0) 
Form the following system and reduce it to echelon form: 

2x + y = a 2x + y = a 

—x + 3y = b or ly — a + 26 

0 = 0 

The system is consistent and so has a solution. Hence W is generated by u and v. (Observe 
that we do not need to solve for x and y; it is only necessary to know that a solution exists.) 

4.26. Show that the vector space V of polynomials over any field K cannot be generated by 
a finite number of vectors. 

Any finite set S of polynomials contains one of maximum degree, say m. Then the linear span 
L(S) of S cannot contain polynomials of degree greater than m. Accordingly, V ^ L(S), for any 
finite set S. 
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4.27. Prove Theorem 4.5: Let S be a nonempty subset of V. Then L(S), the set of all 
linear combinations of vectors in S, is a subspace of V containing S. Furthermore, if 
W is any other subspace of V containing S, then L(S) C W. 

If v G S, then Iv = v G L(S); hence S is a subset of L(S). Also, L(S) is nonempty since S is 
nonempty. Now suppose v, w G L(S); say, 

v — a 1 v l + • ■ ■ + a m v m and w — b^w^ + • • ■ + b n w n 

where v it WjG S and a it bj are scalars. Then 

v + w = a 1 v 1 + • • • + a m v m + 6 x Wj + • • • + b n w n 

and, for any scalar k, 

kv = fc(oii) 1 + • • • + a m v m ) = feajDj + • • • + ka m v m 

belong to L(S) since each is a linear combination of vectors in S. Accordingly, L(S) is a subspace 
of V. 

Now suppose W is a subspace of V containing S and suppose v 1: . . .,v m G S cW. Then all 

multiples a 1 v 1 <im v m. e W, where a f e and hence the sum OjVj + • • • + a m v m G IV. That 

is, W contains all linear combinations of elements of S. Consequently, L(S) c W as claimed. 



ROW SPACE OF A MATRIX 

4.28. Determine whether the following matrices have the same row space: 



A = 



1 5 
3 13 



B 



1 -1 -2 

3 -2 -3 





/I -1 




c = 


4 -3 






\3 -1 


3 



Row reduce each matrix to row canonical form: 
A = 



B = 



C = 




) * g ; 

) * c. 1 1) * c ; i 



to 




to 




to 




Since the nonzero rows of the reduced form of A and of the reduced form of C are the same, 
A and C have the same row space. On the other hand, the nonzero rows of the reduced form of B 
are not the same as the others, and so B has a different row space. 



4.29. Consider an arbitrary matrix A = (an). Suppose u = (6i, ...,&«) is a linear com- 
bination of the rows Ri, . . . , R m of A; say u = kiRi + ■ • • + kmR m . Show that, for each 
i, bi — fcittii + feo2i + • • • + kmdmi where an, . . . , ami are the entries of the ith column 
of A. 

We are given u = k 1 R 1 + • • • + k m R m ; hence 

(bi b n ) = fti(o Ul ...,Oi B ) + ••• + k m (a ml , ...,a mn ) 

= (Ma + • • • + fc m o m i, . • • , fei« m i + • ■ • + k m a mn ) 
Setting corresponding components equal to each other, we obtain the desired result. 
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4.30. Prove: Let A = (ay) be an echelon matrix with distinguished entries Oij x , a 2 j 2 , . . ., a r j r , 
and let B — (&y) be an echelon matrix with distinguished entries b lfcl , b 2 k 2 , ■ . ., b S k s : 



I 



A = 



a,ij 1 ****** 

02j * * * * 





drj r * * 























/ 



B = 



I 



*******\ 
b 2 k 2 ***** 



Suppose A and B have the same row space. Then the distinguished entries of A and 
of B are in the same position: ji — k u jn = fes, . . . , j r = k r , and r = s. 

Clearly A = 0 if and only if B = 0, and so we need only prove the theorem when r — 1 
and 8 — 1. We first show that j 1 — k v Suppose i x < k v Then the j t th column of B is zero. 
Since the first row of A is in the row space of B, we have by the preceding problem, Oj^ = 
c x 0 + c 2 0 + • • • + c m 0 = 0 for scalars e { . But this contradicts the fact that the distinguished 
element a 1Sl ¥> 0. Hence j t — & lf and similarly fc x — j v Thus j t = k y 

Now let A' be the submatrix of A obtained by deleting the first row of A, and let B' be the 
submatrix of B obtained by deleting the first row of B. We prove that A' and B' have the same 
row space. The theorem will then follow by induction since A' and B' are also echelon matrices. 

Let R = (o,,^, . . .,a n ) be any row of A' and let R u ...,R m be the rows of B. Since R is in 
the row space of B, there exist scalars d lt . . . , d m such that R = + d 2 R 2 + ■ ■ ■ + d m R m . Since 
A is in echelon form and R is not the first row of A, the ijth entry of R is zero: o { = 0 for 
i = j t = fcj. Furthermore, since B is in echelon form, all the entries in the k^tb. column of B are 0 



except the first: 6i fc # 0, but 



' J mk. 



= 0. Thus 



0 = a kl = d,6 lfcl + d 2 0 + • • • + d m 0 = dj 



Now 6 lfcl ¥• 0 and so d x = 0. Thus R is a linear combination of R 2 , ...,R m and so is in the row 
space of B'. Since R was any row of A', the row space of A' is contained in the row space of B'. 
Similarly, the row space of B' is contained in the row space of A'. Thus A' and B' have the same 
row space, and so the theorem is proved. 



4.31. Prove Theorem 4.7: Let A = (ay) and B = (&«) be row reduced echelon matrices. 
Then A and B have the same row space if and only if they have the same nonzero rows. 

Obviously, if A and B have the same nonzero rows then they have the same row space. Thus 
we only have to prove the converse. 

Suppose A and B have the same row space, and suppose R 0 is the ith row of A. Then there 
exist scalars c v . . . , c s such that 

R = CjBi + c 2 J? 2 + • • • + c s R s (1) 

where the Ri are the nonzero rows of B. The theorem is proved if we show that R — R it or 
Cj = 1 but c k = 0 for k i. 

Let a ij{ be the distinguished entry in R, i.e. the first nonzero entry of R. By (1) and Problem 4.29, 

««( = C 1 6 U, + C 2 6 2jj + • • • + C s 6 sj . (2) 

But by the preceding problem b i} . is a distinguished entry of B and, since B is row reduced, it is 
the only nonzero entry in the i 4 th column of B. Thus from (g) we obtain a y . = CjOy.. However, 
oy { = 1 and 6y { = 1 since A and B are row reduced; hence c { = 1. 

Now suppose k¥= i, and 6 fcjfc is the distinguished entry in R k . By (1) and Problem 4.29, 

aa k = <>it>ij k + c 2 b 2jk + •••■+ c s b sjk (3) 
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Since B is row reduced, 6 k3fc is the only nonzero entry in the j k th column of B\ hence by (S), 
a U k — c kHj k - Furthermore, by the preceding problem a fcjfc is a distinguished entry of A and, since 
A is row reduced, a jjfc = 0. Thus c fc 6 fcifc = 0 and, since 6 kJfc = 1, c k = 0. Accordingly R = R t 
and the theorem is proved. 

4.32. Determine whether the following matrices have the same column space: 

ll 3 5\ / 1 2 3\ 

A = 1 4 3 , B = —2—3-4 
\l 1 9/ \ 7 12 17/ 

Observe that A and B have the same column space if and only if the transposes A f and B* have 
the same row space. Thus reduce A 1 and B* to row reduced echelon form: 




Since A f and B 1 have the same row space, A and B have the same column space. 

4.33. Let R be a row vector and B a matrix for which RB is denned. Show that RB is a 
linear combination of the rows of B. Furthermore, if A is a matrix for which AB is 
defined, show that the row space of AB is contained in the row space of B. 

Suppose R = K.Oa, ...,o m ) and B = (b ti ). ■ Let B lt ...,B m denote the rows of B and 
B 1 , . . . , B n its columns. Then 

RB = (B«Bi, fi-B^ fi-B") 

= (Ol&n + a 2*>21 + 1" «m&ml. «1&12 + «2&22 + ' ' ' + <»m&m2. • • • . + «2 6 2n 1" «,«U 

= a 1 (6 11 , 6 12 , . . ., + o 2 (6 21 , 6 2 2> • • •» ft 2n) + • • • + a m (& ml , b m2 , . . ., 6 mn ) 
= ajBi + a 2 B 2 + • • • + a m B m 

Thus i2B is a linear combination of the rows of B, as claimed. 

By Problem 3.27, the rows of AB are RjB where R { is the ith row of A. Hence by the above 
result each row of AB is in the row space of B. Thus the row space of AB is contained in the row 
space of B. 



SUMS AND DIRECT SUMS 

4.34. Let U and W be subspaces of a vector space V. Show that: 

(i) U and W are contained in U + W; 

(ii) U + W is the smallest subspace of V containing U and W, that is, U + W is the 
linear span of U and W: U + W = L(U, W). 

(i) Let it£ !/, By hypothesis W is a subspace of y and so 0£ff. Hence a = ti + 06 U+W. 
Accordingly, U is contained in U+W. Similarly, W is contained in U + W. 

(ii) Since U +W is & subspace of V (Theorem 4.8) containing both U and W, it must also contain 
the linear span of U and W: L(U, W) C U + W. 

On the other hand, if v G U + W then v — u + w = lu + Iw where it G U and w €.W\ 
hence v is a linear combination of elements in UuW and so belongs to L(U, W). Thus 
U + W C L(I7, TP). 

The two inclusion relations give us the required result. 
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4.35. Suppose U and W are subspaces of a vector space V, and that {u,} generates U and 
{Wj} generates W. Show that {im, w s }, i.e. {tk} U {w s }, generates U + W. 

Let v G U + W. Then v = u + w where it G U and w G W. Since {%} generates U, u is a 
linear combination of Mj's; and since {vij} generates W, w is a linear combination of w/s: 

m = ffljM^ + a 2 u iz + • • • + a n u in , ttj G K 
w = bjw^ + b 2 w j2 + • • • + b m w jm , bj G K 
Thus v = u + w = a^u^ + atfjt^ + • ■ ■ + a n u in + b^w^ + b 2 u>j 2 + • • • + b m w i m 

and so {u it Wj} generates U+W. 

4.36. Prove Theorem 4.9: The vector space V is the direct sum of its subspaces U and W 
if and only if (i) V = U + W and (ii) t/n W = {0}. 

Suppose V = U (& W. Then any v G V can be uniquely written in the form v = u + w 
where uGU and w G W. Thus, in particular, V = U+W. Now suppose v G UnW. Then: 

(1) v = v + 0 where v G U, 0 G W; and (2) v = 0 + v where 0 £ 17, v G W 
Since such a sum for v must be unique, v — 0. Accordingly, Z7n W = {0}. 

On the other hand, suppose V = U + W &nd UnW = {0}. Let d£V. Since V = 17 + W, 
there exist u G U and tu G W such that v = u + w. We need to show that such a sum is unique. 
Suppose also that v = u' + w' where u' G U and w' G W. Then 

u + w = u' + w' and so u — u' = w' — w 

But u-u' G U and w' - w G W; hence by UnW = {0>, 

u — u' — 0, w' — w = 0 and so it = u', w = w' 

Thus such a sum for o£V is unique and V = 17 © W. 

4.37. Let Z7 and W be the subspaces of R 3 defined by 

U = {(a, b, c) : a = b = c} and W = {(0, b, c)} 
(Note that W is the yz plane.) Show that R 3 = U © TP. 

Note first that UnW = {0}, for v = (a, 6, c) G UnW implies that 

a = b = c and a = 0 which implies a. = 0, i> = 0, c = 0 

i.e. v = (0, 0, 0). 

We also claim that R 3 = U + W. For if v = (a, b, c) G R3, then v = (a, a, a) + (0,b-a,c- a) 
where (a, a, a) G U and (0, b - a, c-a) G W. Both conditions, UnW = {0} and R 3 = *7 + W, 
imply R3 = E7 © W. 



4.38. Let V be the vector space of n-square matrices over a field R. Let U and W be the 
subspaces of symmetric and antisymmetric matrices, respectively. Show that 
V = U © W. (The matrix M is symmetric iff M = M*, and anti-symmetric iff 
M* = -M.) 

We first show that V = U+W. Let A be any arbitrary w-square matrix. Note that 

A = £(A+A') + 4(A-A') 

We claim that £(A + A*) G U and that £(A — A') G W. For 

(£(A+A«))« = $(A+A*)' = £(A* + A") = £(A+A») 
that is, £(A + A') is symmetric. Furthermore, 

(£(A — A'))' = £(A-A«)t - $(A*-A) = — ^(A — A') 
that is, J(A — A') is antisymmetric. 

We next show that E7nt7 = {0}. Suppose AfGl7nW. Then M = M* and M* = -M which 
implies M = -M or M = 0. Hence t/n»7 = {0}. Accordingly, V= 17 © TP. 
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Supplementary Problems 



VECTOR SPACES 

4.39. Let V be the set of infinite sequences (a u a z , . . .) in a field K with addition in V and scalar multi- 
plication on V defined by 

(a u a 2 , . . .) + (6 1( b 2 , . . .) = (a t + b u a 2 + 6 2 , . . .) 

k(a lt a 2 , . . .) = (ka lt ka 2 , . . . ) 
where a it b } , k G K. Show that V is a vector space over K. 

4.40. Let V be the set of ordered pairs (a, 6) of real numbers with addition in V and scalar multiplication 
on V defined bv 

y (a, 6) + (c, d) = (a + c,b + d) and k(a, b) = (ka, 0) 

Show that V satisfies all of the axioms of a vector space except [M 4 ] : lu = u. Hence [Af 4 ] is not a 
consequence of the other axioms. 

4.41. Let V be the set of ordered pairs (a, b) of real numbers. Show that V is not a vector space over R 
with addition in V and scalar multiplication on V defined by: 

(i) (a, b) + (c, d) = (a + d, b + c) and k(a, b) = (ka, kb); 

(ii) (a, 6) + (e, d) — (a + c, b + d) and k(a, b) = (a, 6); 

(iii) (a, 6) + (c, d) - (0, 0) and k(a, b) = {ka, kb); 

(iv) (a, 6) + (c, d) = (ae, bd) and k(a, b) = (ka, kb). 

4.42. Let V be the set of ordered pairs (z lt z 2 ) of complex numbers. Show that V is a vector space over the 
real field R with addition in V and scalar multiplication on V defined by 

(z lt z 2 ) + (w lt w 2 ) = (z t + w lt z 2 + w 2 ) and k(e u z 2 ) = (kz it kz 2 ) 
where z it z 2 ,Wi, w 2 S C and k S B. 

4.43. Let V be a vector space over K, and let F be a subfield of K. Show that V is also a vector space 
over F where vector addition with respect to F is the same as that with respect to K, and where 
scalar multiplication by an element k S F is the same as multiplication by k as an element of K. 

4.44. Show that [A 4 ], page 63, can be derived from the other axioms of a vector space. 

4.45. Let U and W be vector spaces over a field K. Let V be the set of ordered pairs (u, w) where u 
belongs to U and w to W: V = {(u, w) : u e U, w G W}. Show that V is a vector space over K 
with addition in V and scalar multiplication on V defined by 

(u, w) + (it', w') = (u + u',w + w') and k(u, w) - (ku, kw) 

where u, u' GU, w,w' GW and kGK. (This space V is called the external direct sum of U 
and W.) 



SUBSPACES 

4.46. Consider the vector space V in Problem 4.39, of infinite sequences (a lt a 2 , . . .) in a field K. Show 
that W is a subspace of V if: 

(i) W consists of all sequences with 0 as the first component; 

(ii) W consists of all sequences with only a finite number of nonzero components. 

4.47. Determine whether or not Wis a subspace of R 3 if W consists of those vectors (a, b, c) G R 3 for 
which: (i) a = 26; (ii) a^b^c; (iii) ab = 0; (iv) a = 6 = c; (v) a - ft 2 ; (vi) fe x a + k 2 b + k 3 c = 0, 
where k { G R. 

4.48. Let V be the vector space of w-square matrices over a field K. Show that Wis a subspace of V if 
W consists of all matrices which are (i) antisymmetric (A* = -A), (ii) (upper) triangular, 

(iii) diagonal, (iv) scalar. 
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4.49. Let AX — B be a nonhomogeneous system of linear equations in n unknowns over a field K. 
Show that the solution set of the system is not a subspace of K n . 

4.50. Let V be the vector space of all functions from the real field R into R. Show that W is a subspace 
of V in each of the following cases. 

(i) W consists of all bounded functions. (Here / : R ~» R is bounded if there exists M G R such 
that |/(*)| — M, Vx G R.) 

(ii) W consists of all even functions. (Here /:R-* R is even if /( — x) — /(#)> Va; £ R.) 

(iii) W consists of all continuous functions. 

(iv) W consists of all differentiable functions. 

(v) W consists of all integrable functions in, say, the interval 0 — x — 1. 
(The last three cases require some knowledge of analysis.) 

4.51. Discuss whether or not R 2 is a subspace of R 3 . 

4.52. Prove Theorem 4.4: The intersection of any number of subspaces of a vector space V is a subspace 
of V. 

4.53. Suppose U and W are subspaces of V for which UuW is also a subspace. Show that either 
UcW or WcU. 

LINEAR COMBINATIONS 

4.54. Consider the vectors it = (1, —3, 2) and v = (2, —1, 1) in R 3 . 

(i) Write (1, 7, —4) as a linear combination of it and v. 

(ii) Write (2, —5, 4) as a linear combination of it and v. 

(iii) For which value of k is (1, fc, 5) a linear combination of it and vt 

(iv) Find a condition on a, 6 and c so that (a, 6, e) is a linear combination of u and v. 

4.55. Write u as a linear combination of the polynomials v = 2t 2 + Zt — 4 and w = t 2 - 2t — 3 where 
(i) it = 3* 2 + St - 5, (ii) u ~ 4t 2 — 6t — 1. 



LINEAR SPANS, GENERATORS 

4.57. Show that (1, 1, 1), (0, 1, 1) and (0, 1, -1) generate R 3 , i.e. that any vector (a, 6, c) is a linear com- 
bination of the given vectors. 

4.58. Show that the yz plane W = {(0, b, c)} in R« is generated by: (i) (0, 1, 1) and (0, 2, -1); (ii) (0, 1, 2), 
(0, 2, 3) and (0, 3, 1). 

4.59. Show that the complex numbers w = 2 + 3i and z = 1 - 2i generate the complex field C as a 
vector space over the real field R. 

4.60. Show that the polynomials (1 — t) 3 , (1 — t) 2 , 1 — t and 1 generate the space of polynomials of 
degree — 3. 

4.61. Find one vector in R 3 which generates the intersection of U and W where U is the xy plane: 
17 = {(a, b, 0)}, and W is the space generated by the vectors (1, 2, 3) and (1, -1, 1). 

4.62. Prove: L(S) is the intersection of all the subspaces of V containing S. 



4.56. 



Write £ as a linear combination of 




and 
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4.63. Show that L(S) = L(S U {0}). That is, by joining or deleting the zero vector from a set, we do not 
change the space generated by the set. 

4.64. Show that if S c T, then L(S) c L(T). 

4.65. Show that L(L(S)) = L(S). 

ROW SPACE OF A MATRIX 

4.66. Determine which of the following matrices have the same row space: 

* -(;"-:)■ "-(• 

4.67. Let Ml = (1,1,-1), w 2 = (2,3,-1), u 3 = (3,1,-5) 

v, - (1,-1,-3), v 2 = (3,-2,-8), v s = (2,1,-3) 
Show that the subspace of R 8 generated by the u { is the same as the subspace generated by the v t . 

4.68. Show that if any row of an echelon (row reduced echelon) matrix is deleted, then the resulting 
matrix is still in echelon (row reduced echelon) form. 

4.69. Prove the converse of Theorem 4.6: Matrices with the same row space (and the same size) are 
row equivalent. 

4.70. Show that A and B have the same column space iff A 1 and B t have the same row space. 

4.71. Let A and B be matrices for which AB is denned. Show that the column space of AB is contained 
in the column space of A. 

SUMS AND DIRECT SUMS 

4.72. We extend the notion of sum to arbitrary nonempty subsets (not necessarily subspaces) S and T of 
a vector space V by denning S + T = {s + t : s S S, t 6 T}. Show that this operation satisfies: 

(i) commutative law: S+T = T + S; 

(ii) associative law: (S t + S 2 ) + S s = S 1 + (S 2 + S 3 ) ; 

(iii) S + {0} = {0} + S = S; 

(iv) S + V = V + S = V. 

4.73. Show that for any subspace W of a vector space V, W + W = W. 

4.74. Give an example of a subset S of a vector space V which is not a subspace of V but for which 
(i) S + S - S, (ii) S + S C S (properly contained). 

4.75. We extend the notion of sum of subspaces to more than two summands as follows. If W lt W 2 , • ■ ■ , W n 
are subspaces of V, then 

Wi + W 2 + ••• + W n = {w 1 + w 2 + ■■■ +w n : Wj e W t } 

Show that: 

(i) L(W V W 2 W n ) = W, + W 2 + ■ ■ ■ + W n ; 

(ii) if Si generates W„ i = 1, . . . , «, then S t U S 2 U • • • U S n generates W x + W 2 + • • ■ + W n . 

4.76. Suppose U, V ami W are subspaces of a vector space. Prove that 

(UnV) + (UnW) c Un(V+W) 

Find subspaces of R 2 for which equality does not hold. 
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4.77. Let U, V and W be the following subspaces of R 3 : 

U = {(a,b,c): a+b + c = 0}, V = {(a,b,e): a = c], W = {(0,0,c): c£E) 
Show that (i) R3 = V + V, (ii) R3 = V + W, (iii) R» = V + W. When is the sum direct? 

4.78. Let V be the vector space of all functions from the real field R into R. Let U be the subspace of 
even functions and W the subspace of odd functions. Show that V = U © W. (Recall that / is 
even iff f(-x) = f(x), and / is odd iff f(-x) = -f(x).) 

4.79. Let W u W 2 , ... be subspaces of a vector space V for which W t c W 2 C • • • . Let W = U W 2 U • • • . 
Show that W is a subspace of V. 

4.80. In the preceding problem, suppose S t generates W t , i = 1,2, Show that S = S 1 U S 2 U ■ • • 

generates W. 

4.81. Let V be the vector space of w-square matrices over a field K. Let U be the subspace of upper 
triangular matrices and W the subspace of lower triangular matrices. Find (i) U + W , (ii) UnW. 

4.82. Let V be the external direct sum of the vector spaces U and W over a field K. (See Problem 4.45.) 
Let a. /v 

U = {(u,0): uGU}, W = {(0,w): w G W} 

Show that (i) U and W are subspaces of V, (ii) V = C7 © 



Answers to Supplementary Problems 

4.47. (i) Yes. (iv) Yes. 

(ii) No;e.g. (1,2,2) eW but -2(1,2,3) € W. (v) No; e.g. (9,3,0) €W but 2(9, 3, 0) <Z 

(iii) No;e.g. (1, 0, 0), (0, 1, 0) € W, (vi) Yes. 
but not their sum. 

4.50. (i) Let f,g&W with M f and M g bounds for / and g respectively. Then for any scalars a, b G R, 

\(af+bg)(x)\ = |a/(*) + 6tf(*)| -- \af(x)\ + \bg(x)\ = \a\ \f(x)\ + \b\ \g(x)\ ^ \a\M f + \b\M g 
That is, \a\M f + \b\M g is a bound for the function af+bg. 

(ii) (af + bg)(-x) = af(-x) + bg(-x) = af(x) + bg(x) = (af + bg)(x) 

4.51. No. Although one may "identify" the vector (a,b) G R2 with, say, (a,b,0) in the xy plane in R3, 
they are distinct elements belonging to distinct, disjoint sets. 

4.54. (i) -3it + 2v. (ii) Impossible, (iii) k = -8. (iv) a - 36 - 5c = 0. 

4.55. (i) u — 2v — w. (ii) Impossible. 

4.56. (i) E = 2A-B + 2C. (ii) Impossible. 
4.61. (2, -5, 0). 

4.66. A and C. 

4.67. Form the matrix A whose rows are the u K and the matrix B whose rows are the v it and then show 
that A and B have the same row canonical forms. 

4.74. (i) InR2,let S = {(0,0), (0,1), (0,2), (0,3), . . .}. 
(ii) InR2,Iet S = {(0,5), (0,6), (0,7), ...}. 

4.77. The sum is direct in (ii) and (iii). 

4.78. Hint. f(x) = $(f(x) + /(-*)) + %(f(x) - /(-*)), where £(/(*) + /(-»)) is even and £(/(«) - /(-*)) 
is odd. 

4.81. (i) V = U + W. (ii) Un W is the space of diagonal matrices. 



Chapter 5 



Basis and Dimension 

INTRODUCTION 

Some of the fundamental results proven in this chapter are: 

(i) The "dimension" of a vector space is well defined (Theorem 5.3). 

(ii) If V has dimension n over K, then V is "isomorphic" to K n (Theorem 5.12). 

(iii) A system of linear equations has a solution if and only if the coefficient and 
augmented matrices have the same "rank" (Theorem 5.10). 

These concepts and results are nontrivial and answer certain questions raised and investi- 
gated by mathematicians of yesterday. 

We will begin the chapter with the definition of linear dependence and independence. 
This concept plays an essential role in the theory of linear algebra and in mathematics in 
general. 



LINEAR DEPENDENCE 

Definition: Let V be a vector space over a field K. The vectors v u ...,v m GV are said 
to be linearly dependent over K, or simply dependent, if there exist scalars 
a u . . . , a m € K, not all of them 0, such that 

dlVi + (I2V2. + ■ • • + dmVm = 0 (*) 

Otherwise, the vectors are said to be linearly independent over K, or simply 
independent. 

Observe that the relation (*) will always hold if the a's are all 0. If this relation holds 
only in this case, that is, 

am + a 2 v 2 + ■ ■ • + OmVm — 0 only if Ci = 0, . . . , a™ = 0 
then the vectors are linearly independent. On the other hand, if the relation (*) also holds 
when one of the a's is not 0, then the vectors are linearly dependent. 

Observe that if 0 is one of the vectors Vi, ...,v m , say vi = 0, then the vectors must be 
dependent; for 

lvt + 0v 2 + ■•■ + 0v m = 1 • 0 + 0 + • • • + 0 = 0 

and the coefficient of vi is not 0. On the other hand, any nonzero vector v is, by itself, 

independent; for , A L A . , _ A 

kv — 0, v ¥° 0 implies k = 0 

Other examples of dependent and independent vectors follow. 

Example 5.1: The vectors « = (1,-1,0), v = (1,3,-1) and w = (5, 3, -2) are dependent since, 
for 3m + 2v - w = 0, 

3(1, -1, 0) + 2(1, 3, -1) - (5, 3, -2) = (0, 0, 0) 
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Example 5.2: We show that the vectors u = (6, 2, 3, 4), v = (0, 5, -3, 1) and w = (0, 0, 7, -2) 
are independent. For suppose xu + yv + zw = 0 where x, y and z are unknown 
scalars. Then 

(0, 0, 0, 0) = x(6, 2, 3, 4) + 3,(0, 5, -3, 1) + z(0, 0, 7, -2) 

= (6x, 2x + 5y, 3x-Sy + 7z, 4x + y - 2z) 

and so, by the equality of the corresponding components, 

6x = 0 

2* + 5y = 0 

3x - 3y + 7z = 0 

4x + y — 2z = 0 

The first equation yields x = 0; the second equation with x = 0 yields J/ = 0; and 
the third equation with x = 0, y — 0 yields z = 0. Thus 

xm + i/u + zw = 0 implies x = 0, y = 0, z = 0 

Accordingly it, u and to are independent. 

Observe that the vectors in the preceding example form a matrix in echelon form: 

/6 2 3 4\ 
0 5-3 1 
\0 0 7-2/ 

Thus we have shown that the (nonzero) rows of the above echelon matrix are independent. 
This result holds true in general; we state it formally as a theorem since it will be frequently- 
used. 

Theorem 5.1: The nonzero rows of a matrix in echelon form are linearly independent. 

For more than one vector, the concept of dependence can be defined equivalently as 
follows: 

The vectors vi, . . . , v m are linearly dependent if and only if one of them is a linear 
combination of the others. 

For suppose, say, Vi is a linear combination of the others: 

Vi = CliVi + • • ■ + di-iVi-i + Oi+lVi+I + • • • + dmVrn 

Then by adding — v t to both sides, we obtain 

aiVl + • • • + Oi-iVi-i - Vi + Oi + lVi+i + • • • + dmVm — 0 

where the coefficient of v { is not 0; hence the vectors are linearly dependent. Conversely, 
suppose the vectors are linearly dependent, say, 

biVi + • • • + bjVj + • • • + b m v m = 0 where b, ¥* 0 

Then Vj = — by 1 biV\ — ••• — bf 1 b i -iVj-i — &, rl &i+i v i+i — ••• — bf 1 b m v m 

and so Vj is a linear combination of the other vectors. 

We now make a slightly stronger statement than that above; this result has many im- 
portant consequences. 

Lemma 5.2: The nonzero vectors vi, . . ., v m are linearly dependent if and only if one of 
them, say v ir is a linear combination of the preceding vectors: 

Vi — kiVi + k 2 V2 + ■ • • + fci-iVi-i 
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Remark 1. The set {vi, . . . , v m } is called a dependent or independent set according as the 
vectors Vi, . . . , v m are dependent or independent. We also define the empty- 
set 0 to be independent. 

Remark 2. If two of the vectors v u . . . , v m are equal, say vi - v 2 , then the vectors are 
dependent, For Vl - V2 + 0v 3 + ■ ■ ■ + 0v m = 0 

and the coefficient of vi is not 0. 

Remark 3. Two vectors Vi and v 2 are dependent if and only if one of them is a multiple of 
the other. 

Remark 4. A set which contains a dependent subset is itself dependent. Hence any 
subset of an independent set is independent. 

Remark 5. If the set {v h .... v m } is independent, then any rearrangement of the vectors 
{v^, Vi 2 , . . . , Vi m } is also independent. 

Remark 6. In the real space R 3 , dependence of vectors can be described geometrically as 
follows: any two vectors u and v are dependent if and only if they lie on the 
same line through the origin; and any three vectors u, v and w are dependent 
if and only if they lie on the same plane through the origin: 





u and v are dependent. 



it, v and w are dependent. 



BASIS AND DIMENSION 

We begin with a definition. 
Definition: A vector space V is said to be of finite dimension n or to be n-dimensional, 
written dim V = n, if there exists linearly independent vectors e h e 2 , ...,e n 
which span V. The sequence {ei, e 2 , . . ., e„} is then called a basis of V. 
The above definition of dimension is well defined in view of the following theorem. 
Theorem 5.3: Let V be a finite dimensional vector space. Then every basis of V has the 
same number of elements. 

The vector space {0} is defined to have dimension 0. (In a certain sense this agrees with 
the above definition since, by definition, 0 is independent and generates {0}.) When a 
vector space is not of finite dimension, it is said to be of infinite dimension. 

Example 5.3: Let K be any field. Consider the vector space K n which consists of n-tuples of ele- 
ments of K. The vectors n 

e x = (1, 0, 0, . . ., 0, 0) 

e 2 = (0,1,0, ...,0,0) 



e„ = (0,0,0 0,1) 

form a basis, called the usual basis, of K n . Thus K n has dimension n. 
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Example 5.4: Let U be the vector space of all 2X3 matrices over a field K. Then the matrices 

/l 0 ON /0 1 ON /0 0 IN 
\0 0 0/' \0 0 0/' V,0 0 0/' 

/0 0 ON /0 0 ON /0 0 ON 

\i o o/' Vo i oy' 1,0 o iy 

form a basis of £/. Thus dim U — 6. More generally, let V be the vector space 
of all m X n matrices over K and let E tj £7 be the matrix with iy-entry 1 and 0 
elsewhere. Then the set {Sy} is a basis, called the usual basis, of V (Problem 5.32); 
consequently dim V — mn. 

Example 5.5 : Let W be the vector space of polynomials (in t) of degree — n. The set {1, t, t 2 , . . . , t n } 
is linearly independent and generates W. Thus it is a basis of W and so 
dim W = n + 1. 

We comment that the vector space V of all polynomials is not finite dimensional 
since (Problem 4.26) no finite set of polynomials generates V. 

The above fundamental theorem on dimension is a consequence of the following im- 
portant "replacement lemma": 

Lemma 5.4: Suppose the set {vi, v 2 , . . ., v n ) generates a vector space V. If {wi, . . ., w m } 
is linearly independent, then m — n and V is generated by a set of the form 

{Wi, . . ., W m , V h , . . ., Vi n _J 

Thus, in particular, any « + l or more vectors in V are linearly dependent. 

Observe in the above lemma that we have replaced m of the vectors in the generating 
set by the m independent vectors and still retained a generating set. 

Now suppose S is a subset of a vector space V. We call {vi, . . . , v m ) a maximal in- 
dependent subset of S if: 

(i) it is an independent subset of S; and 

(ii) {vi, . . ., v m , w) is dependent for any w € S. 

The following theorem applies. 

Theorem 5.5: Suppose S generates V and {v h . . ., v m ) is a maximal independent subset 
of S. Then {v ir . . ., v m } is a basis of V. 

The main relationship between the dimension of a vector space and its independent 
subsets is contained in the next theorem. 

Theorem 5.6: Let V be of finite dimension n. Then: 

(i) Any set of n + 1 or more vectors is linearly dependent. 

(ii) Any linearly independent set is part of a basis, i.e. can be extended to 
a basis. 

(iii) A linearly independent set with n elements is a basis. 



Example 5.6: The four vectors in K 4 

(1, 1, 1, 1), (0, 1, 1, 1), (0, 0, 1, 1), (0, 0, 0, 1) 

are linearly independent since they form a matrix in echelon form. Furthermore, 
since dim K* = 4, they form a basis of K*. 

Example 5.7: The four vectors in R 3 , 

(257,-132,58), (43,0,-17), (521,-317,94), (328,-512,-731) 
must be linearly dependent since they come from a vector space of dimension 3. 
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DIMENSION AND SUBSPACES 

The following theorems give basic relationships between the dimension of a vector space 
and the dimension of a subspace. 

Theorem 5.7: Let W be a subspace of an n-dimension vector space V. Then dim W^n. 
In particular if dim W = n, then W - V. 

Example 5.8: Let W be a- subspace of the real space R 3 . Now dim R 3 = 3; hence by the preced- 
ing theorem the dimension of W can only be 0, 1, 2 or 3. The following cases apply: 

(i) dim W = 0, then W = {0}, a point; 

(ii) dim W = 1, then W is a line through the origin; 

(iii) dim W = 2, then W is a plane through the origin; 

(iv) dim W = 3, then W is the entire space R 3 . 

Theorem 5.8: Let U and W be finite-dimensional subspaces of a vector space V. Then 
U + W has finite dimension and 

dim(U + V) = dim U + dim W - dim (UnW) 

Note that if V is the direct sum of U and W, i.e. V = U © W, then dim V = 
dim U + dim W (Problem 5.48). 

Example 5.9: Suppose U and W are the xy plane and yz plane, respectively, in R 3 : U - {(o, 6,0)}, 
W = {(0, b, e)}. Since R s = U + W, dim (U + W) = 3. Also, dim U = 2 and 
dim W -2. By the above theorem, 

3 = 2 + 2 - dim(l7nW0 or dim(I/nW0 = 1 
Observe that this agrees with the fact that UnW is the y axis, i.e. UnW = 
{(0, 6, 0)}, and so has dimension 1. 




RANK OF A MATRIX 

Let A be an arbitrary m x n matrix over a field K. Recall that the row space of A is 
the subspace of K n generated by its rows, and the column space of A is the subspace of K m 
generated by its columns. The dimensions of the row space and of the column space of A 
are called, respectively, the row rank and the column rank of A. 

Theorem 5.9: The row rank and the column rank of the matrix A are equal. 

Definition: The rank of the matrix A, written rank (A), is the common value of its row 
rank and column rank. 

Thus the rank of a matrix gives the maximum number of independent rows, and also 
the maximum number of independent columns. We can obtain the rank of a matrix as 
follows. 

ll 2 0 -1\ 

Suppose A = 2 6 -3 -3 . We reduce A to echelon form using the elementary 
row operations: \ 3 10-6 —5/ 
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A to 




to 



/I 


2 


0 


0 


2 


-3 


\o 


0 


0 



Recall that row equivalent matrices have the same row space. Thus the nonzero rows of the 
echelon matrix, which are independent by Theorem 5.1, form a basis of the row space of 
A. Hence that rank of A is 2. 



APPLICATIONS TO LINEAR EQUATIONS 

Consider a system of m linear equations in n unknowns X\, . . . , «» over a field K: 

auXi + anx 2 + • • • + ainX„ = bi 

Ol\X\ + OnXi + • ■ • + (tinXn = bi 



OmiXi + Om.-2.Xi + • • • + fflmnl, = 6m 

or the equivalent matrix equation 

AX — B 

where A = (ay) is the coefficient matrix, and X = and B = (b { ) are the column vectors 
consisting of the unknowns and of the constants, respectively. Recall that the augmented 
matrix of the system is defined to be the matrix 



(A, B) = 



fan 
an 



dl2 

Ct22 



Ctln 
ain 



62 



ami 



Remark 1. 



The above linear equations are said to be dependent or independent according 
as the corresponding vectors, i.e. the rows of the augmented matrix, are 
dependent or independent. 



Remark 2. Two systems of linear equations are equivalent if and only if the corresponding 
augmented matrices are row equivalent, i.e. have the same row space. 

Remark 3. We can always replace a system of equations by a system of independent 
equations, such as a system in echelon form. The number of independent 
equations will always be equal to the rank of the augmented matrix. 

Observe that the above system is also equivalent to the vector equation 



Xi 



On \ 




/ttl2 


a 2 i 


| + .( 


022 




Oml j 




\O m 2 



+ 



+ Xn 


lai„\ 

ain 


lb\ 




\Omn/ 


\bmj 



Thus the system AX = B has a solution if and only if the column vector £ is a linear 
combination of the columns of the matrix A, i.e. belongs to the column space of A. This 
gives us the following basic existence theorem. 



Theorem 5.10: The system of linear equations AX — B has a solution if and only if the 
coefficient matrix A and the augmented matrix (A, B) have the same rank. 
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Recall (Theorem 2.1) that if the system AX = B does have a solution, say v, then its 
general solution is of the form v + W = {v + w: w G W} where W is the general solution 
of the associated homogeneous system AX — 0. Now If is a subspace of K n and so has a 
dimension. The next theorem, whose proof is postponed until the next chapter (page 127), 
applies. 

Theorem 5.11: The dimension of the solution space W of the homogeneous system of linear 
equations AX = 0 is n — r where n is the number of unknowns and r is 
the rank of the coefficient matrix A. 

In case the system AX = 0 is in echelon form, then it has precisely n-r free variables 
(see page 21), say, x h ,x h , . . .,x in _ r . Let v } be the solution obtained by setting 8^ = 1, 
and all other free variables = 0. Then the solutions Vi, v n -r are linearly independent 
(Problem 5.43) and so form a basis for the solution space. 

Example 5.10: Find the dimension and a basis of the solution space W of the system of linear 
equations 

x + 2y - 4z + 3r - s = 0 
x + 2y - 2z + 2r + s = 0 
2x + 4y - 2z + 3r + 4s = 0 

Reduce the system to echelon form: 

x + 2y - 4* + 3r - s = 0 , , „ „_ n 

x + 2y — 4« + 3r — s = 0 

2z - r + 2s = 0 and then _ 

2z — r + 2s — 0 

6z - 3r + 6s = 0 

There are 5 unknowns and 2 (nonzero) equations in echelon form; hence dim W = 
5 — 2 = 3. Note that the free variables are y, r and s. Set: 

(i) y = 1, r = 0, 8 = 0, (ii) y = 0, r = 1, s = 0, (iii) j/ = 0, r = 0, 8 = 1 

to obtain the following respective solutions: 

Vl = (-2,1,0,0,0), v 2 = (-1,0,^,1,0), ^ = (-3,0,-1,0,1) 

The set {v v v 2 , v 3 } is a basis of the solution space W. 



COORDINATES 

Let {d, . . . , ej be a basis of an w-dimensional vector space V over a field and let v 
be any vector in V. Since {e f } generates V, v is a linear combination of the e,: 

d =. aid + a2«2 + • • • + a n e n , an^K 

Since the d are independent, such a representation is unique (Problem 5.7), i.e. the n 
scalars oi, . . ., a„ are completely determined by the vector v and the basis {ei}. We call 
these scalars the coordinates of v in {d}, and we call the «-tuple (oi, . . ., On) the coordinate 
vector of -v relative to {ei} and denote it by [v] e or simply [v]: 

[V]e — (Oi, a 2 , . . . , On) 

Example 5.11 : Let V be the vector space of polynomials with degree - 2: 

V = {o* 2 +6t + c: a,6,ceR} 

The polynomials 

6l = 1, e 2 = t-1 and e 3 = (t-1) 2 = i 2 - 2t + 1 
form a basis for V. Let v = 2t 2 -5t + 6. Find [v] e , the coordinate vector of v 
relative to the basis {ei, e 2 , e 3 }. 
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Set v as a linear combination of the e ; using the unknowns x, y and z: v = aej + 
ye 2 + ze 3 . 

2*2 - 5t + 6 = «(1) + y(t - 1) + z(f2 - 2t + 1) 
= a; + j/t - 3/ + z«2 _ 2zt + z 
= z*2 + - 2z)t + (a; - v + z) 
Then set the coefficients of the same powers of t equal to each other: 

x — y + z = 6 
2/ - 2z = -5 
z = 2 

The solution of the above system is x - S, y = -1, z = 2. Thus 
d = 3 ei - e 2 + 2e 3 , and so [«]„ = (3, -1, 2) 

Example 5.12: Consider the real space R 3 . Find the coordinate vector of v = (3, 1, -4) relative to 
the basis /, = (1, 1, 1), f 2 = (0, 1, 1), f 3 = (0, 0, 1). 

Set v as a linear combination of the /; using the unknowns x, y and z: v = xf t + 
Vfi. + «/ 3 - 

(3, 1, -4) = x(l, 1, 1) + y(Q, 1, 1) + Z (0, 0, 1) 
= (x, x, x) + (0, y, y) + (0, 0, z) 
= (x, x + y, x + y + z) 

Then set the corresponding components equal to each other to obtain the equivalent 
system of equations 

* =3 
x + y = 1 
x + y + z = —4 

having solution * = 3, y = -2, z- -5. Thus [v] f = (3, -2, -5). 

We remark that relative to the usual basis e x = (i ( n, 0), e 2 = (0, 1, 0), e 3 = 
(0, 0, 1), the coordinate vector of v is identical to v itself: [v] e = (3, 1, -4) = v . 

We have shown above that to each vector v € V there corresponds, relative to a given 
basis {ex, e n }, an n-tuple [v] e in K n . On the other hand, if (ai, . . . , On) G K", then there 
exists a vector in V of the form a x e x +•■• +a„e„. Thus the basis {ej} determines a one-to- 
one correspondence between the vectors in V and the n-tuples in K\ Observe also that if 

v = aiei + • • • + (ue n corresponds to (oi, . . . , a„) 
and w = + ■ • • + 6„e„ corresponds to (6i, . . . , 6„) 

then 

v + w = (ai + &i)e! + • • • + (o„ + 6„)e„ corresponds to (oi, . . . , a„) + (6i, . . . , b„) 
and, for any scalar k G K, 

kv = (kai)ei + ■ ■ ■ + (kou)e„ corresponds to k{a h . . . , a„) 
That is, [v + w] e = [v] e + [w] e and [kv], = k[v] e 

Thus the above one-to-one correspondence between V and K n preserves the vector space 
operations of vector addition and scalar multiplication; we then say that V and K n are 
isomorphic, written V a K\ We state this result formally. 

Theorem 5.12: Let V be an w-dimensional vector space over a field K. Then V and K n are 
isomorphic. 
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The next example gives a practical application of the above result. 

Example 5.13: Determine whether the following matrices are dependent or independent: 

*-G o1> * = G c ^(- »"") 

The coordinate vectors of the above matrices relative to the basis in Example 
5.4, page 89, are 

[A] = (1,2,-3,4,0,1), [B] = (1,3,-4,6,5,4), [C] = (3,8,-11,16,10,9) 

Form the matrix M whose rows are the above coordinate vectors: 

jl 2-3 4 0 l\ 

M = 1 3 -4 6 5 4 

\3 8 -11 16 10 9/ 

Row reduce M to echelon form: 

ll 2 -3 4 0 l\ /l 2 -3 4 0 l\ 

M to 0 1 —1 2 5 3 to 0 1-12 5 3 
\0 2 -2 4 10 6/ \0 0 0 0 0 0/ 

Since the echelon matrix has only two nonzero rows, the coordinate vectors [A], [B] 
and [C] generate a space of dimension 2 and so are dependent. Accordingly, the 
original matrices A , B and C are dependent. 



Solved Problems 

LINEAR DEPENDENCE 

5.1. Determine whether or not u and v are linearly dependent if: 

(i) u = (3,4), v = (1,-3) (iii) u = (4,3,-2), v = (2,-6,7) 

(ii) u = (2,-3), v = (6,-9) (iv) u = (-4,6,-2), v = (2,-3,1) 

' \3 0-1/ 0-2/ ^6 -5 i) [l 2-3 

(vii) u = 2 - 5t + 6t 2 - t 3 , v = 3 + 2t - At 2 + 5t s 

(viii) u = l-St + 2t 2 - St 3 , v = -3 + 9* - 6t 2 + 9t 3 

Two vectors u and v are dependent if and only if one is a multiple of the other. 

(i) No. (ii) Yes; for v = 3m. (iii) No. (iv) Yes; for u = —2v. (v) Yes; for v = 2u. (vi) No. 
(vii) No. (viii) Yes; for v = —3m. 

5.2. Determine whether or not the following vectors in R 3 are linearly dependent: 

(i) (1,-2,1), (2, 1,-1), (7, -4,1) (iii) (1,2,-3), (1,-3,2), (2,-1,5) 

(ii) (1, -3, 7), (2, 0, -6), (3, -1, -1), (2, 4, -5) (iv) (2, -3, 7), (0, 0, 0), (3, -1, -4) 

(i) Method 1. Set a linear combination of the vectors equal to the zero vector using unknown 
scalars x, y and z: 

*(1, -2, 1) + y{2, 1, -1) + z(7, -4, 1) = (0, 0, 0) 
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Then (*, ~2x, x) + (2y, y, -y) + (Iz, -4z, z) = (0, 0, 0) 

or (x + 2y + Iz, -2x + y — 4z, x — y + z) - (0, 0, 0) 

Set corresponding components equal to each other to obtain the equivalent homogeneous system, 
and reduce to echelon form: 

* + 2y + 7z = 0 x + 2y + 7z = 0 

„ x + 2y + Iz = 0 

-2a; + j/ - 4z = 0 or 5y + lOz = 0 or 

2/ + 2z = 0 

a - y + z = 0 -3j/ - 6z = 0 

The system, in echelon form, has only two nonzero equations in the three unknowns; hence the 
system has a nonzero solution. Thus the original vectors are linearly dependent. 

Method 2. Form the matrix whose rows are the given vectors, and reduce to echelon form using 
the elementary row operations: 






to 0 5-3 to 
\0 10 -6/ 

Since the echelon matrix has a zero row, the vectors are dependent. (The three given vectors 
generate a space of dimension 2.) 

(ii) Yes, since any four (or more) vectors in R 3 are dependent. 

(iii) Form the matrix whose rows are the given vectors, and row reduce the matrix to echelon form: 

to 

Since the echelon matrix has no zero rows, the vectors are independent. (The three given vectors 
generate a space of dimension 3.) 

(iv) Since 0 = (0, 0, 0) is one of the vectors, the vectors are dependent. 




2 






/I 


2 




5 


3 


to 


0 


-5 


1) 


5 






\o 


0 





5.3. Let V be the vector space of 2 x 2 matrices over R. Determine whether the matrices 
A,B,C GV are dependent where: 

«Hi i> Mo 1 :)■ c -d o 



3 lj' \2 2)' " [-4 0 

(i) Set a linear combination of the matrices A, B and C equal to the zero matrix using unknown 
scalars x, y and z; that is, set xA + yB + zC = 0. Thus: 



or 



■G i) + <i I) + <l I) - C :) 

c :) + « :) + c :) ■ c :) 

/x + y + z x + z\ _ /0 0\ 
V x x + y) ~ \0 Oj 
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Set corresponding entries equal to each other to obtain the equivalent homogeneous system of 
equations: 

x + y + z — 0 
x + z - 0 
x =0 
x + y =0 

Solving the above system we obtain only the zero solution, x = 0, y = 0, z = 0. We have 
shown that xA + yB + zC implies x = 0, y = 0, z - 0; hence the matrices A,B and C are 
linearly independent. 

Set a linear combination of the matrices A,B and C equal to the zero vector using unknown 
scalars x, y and z; that is, set xA + yB + zC = 0. Thus: 

■G i) + "G") + 'U~o) - G ») 

/ a; 2a;\ /Zy -y\ I z -5z\ /0 0\ 

/ x + 3j/ + z 2* - - 5z\ /0 0\ 



Set corresponding entries equal to each other 
linear equations and reduce to echelon form: 

* + 3j/ + z = 0 
2x - y — 5z = 0 
3* + 2j/ - 4z = 0 

* + 2y =0 



to obtain the equivalent homogeneous system of 

x + Zy + z = 0 

-7y - 7z = 0 

° r -ly - 7z = 0 

—J/ — z = 0 



or finally x + 3y + z = 0 

3/ + z = 0 

The system in echelon form has a free variable and hence a nonzero solution, for example, 
x = 2, y = -1, z = 1. We have shown that + 3/B + zC = 0 does not imply that x = 0, 
^ = o, z = 0; hence the matrices are linearly dependent. 



Let V be the vector space of polynomials of degree — 3 over E. Determine whether 
u, v, w G V are independent or dependent where: 

(i) u = i 3 - St 2 + 5t + 1, v = t s - t 2 + St + 2, w = 2*8 - 4t 2 + 9* + 5 

(ii) « = J 3 + 4t 2 - 2t + 3, v = t 3 + 6t 2 - t + 4, w = 34 s + 8t 2 - St + 7 

(i) Set a linear combination of the polynomials u,v and w equal to the zero polynomial using 
unknown scalars x, y and z; that is, set xu + yv + zw = 0. Thus: 

a;^ - 3t 2 + 5t + 1) + 3/(t»-t 2 + 8t + 2) + z(2t 3 - 4t 2 + 9t + 5) = 0 

or xt 3 - Sxt 2 + 5xt + x + y& - yt 2 + Syt + 2y + 2zt» - 4zt 2 + 9zt + 5z = 0 

or (x + y + 2z)t? + (-3* - y - 4z)t 2 + (5* + 8y + 9z)t + (x + 2j/ + 5z) = 0 

The coefficients of the powers of t must each be 0: 

x + y + 2z = 0 

— 3x — y — 4z = 0 

5x + Sy + 9z = 0 

* + 2y + 5z = 0 

Solving the above homogeneous system, we obtain only the zero solution: * = 0, y = 0, z = 0; 
hence u, v and w are independent. 
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(ii) Set a linear combination of the polynomials u, v and w equal to the zero polynomial using 
unknown scalars x, y and z; that is, set xu + yv + zw = 0. Thus: 

a .( t 3 + 4 j2_2t + 3) +j,(f8 + 6t*- t + 4) + z(3t* + 8t 2 -8t + 7) = 0 
or xt 3 + 4xt 2 - 2xt + 3x + y& + 6yt 2 - yt + 4y + 3zt* + 8zt 2 - 8zt + 7z = 0 

or (x + y + Zz)t* + (4x + 6y + »z)P+ (~2x-y-Sz)t + (3x + 4y + 72) = 0 

Set the coefficients of the powers of * each equal to 0 and reduce the system to echelon form: 

x + y + Sz = 0 x + y + 3z = 0 

4x + 6y + 8z = 0 2y - 4z = 0 

or 

-2x - y — 8z = 0 2/-2z = 0 

3* + 4j/ + 7z = 0 y - 2z = 0 

or finally x + y + &z _ 0 

2/ - 22 = 0 

The system in echelon form has a free variable and hence a nonzero solution. We have shown 
that xu + yv + zw = 0 does not imply that x = 0, j/ = 0, 2 = 0; hence the polynomials are 
linearly dependent. 

5.5. Let V be the vector space of functions from R into R. Show that f,g,hGV are 
independent where: (i) f(t) = e* g(t) = t\ h(t) = t; (ii) f(t) = sin<, g(t) = cost, 
h(t) = £. 

In each case set a linear combination of the functions equal to the zero function 0 using unknown 
scalars *, y and z: xf + yg + zh = 0; and then show that x = 0, y = 0, z = 0. We emphasize that 
xf + yg + zh = 0 means that, for every value of t, xf(t) + yg(t) + zh(t) = 0. 

(i) In the equation xe 2t + yt 2 + zt = 0, substitute 

t = 0 to obtain xe" + yO + zO = 0 or x = 0 

t = 1 to obtain xe 2 + y + z =0 

t = 2 to obtain xe* + 4y + 2z = 0 

x =0 

xe 2 + y + z = 0 to obtain only the zero solution: x = 0, y = 0, 2 = 0. 

xe* + 4y + 2z = 0 
Hence /, # and are independent. 



Solve the system ■ 



(ii) Method 1. In the equation x sin t + y cos t + zt = 0, substitute 

t = 0 to obtain x'0 + j/*l + 2- 0 = 0 or j/ = 0 

* = w/2 to obtain x • 1 + y • 0 + 2tt/2 = 0 or x + vz/2 = 0 

t = 7T to obtain x • 0 + 1) + z • tr = 0 or — j/ + ttz = 0 

1/ = 0 

x + vz/2 = 0 to obtain only the zero solution: x = 0, y = 0, z = 0. Hence 

"2/ + 7T2 = 0 

/, g and ft are independent. 



Solve the system 



Method 2. Take the first, second and third derivatives of x sin t + y cos t + zt = 0 with 
respect to t to get 

x cos t — y sin t + z = 0 (i) 

— x sin * — {/ cos t = 0 (g) 

— x cos t + 2/ sin * = 0 (5) 
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Add {1) and (8) to obtain z = 0. Multiply (2) by sin t and (3) by cos i, and then add: 
sin t X (2): — x sin 2 t — y sin t cos t = 0 
cos t X (3): —a; cos 2 1 + 3/ sin t cos t = 0 

-x(sin 2 1 + cos 2 1) =0 or x = 0 

Lastly, multiply (2) by — cos t and (5) by sin t; and then add to obtain 

2/(cos 2 1 + sin 2 t) = 0 or j/ = 0 
Since sc sin t + y cos * + zt = 0 implies a; = 0, i/ = 0, z = 0 

/, fir and h are independent. 

Let u, v and w be independent vectors. Show that u + v, u-v and u - 2v + w are 
also independent. 

Suppose x(u + v) + y(u - v) + z(u - 2v + w) = 0 where x, y and z are scalars. Then 
xu + xv + yu — yv + zu — 2zv + zw = 0 or 

(x + y + z)u + (x — y — 2z)v + zw = 0 
But u, v and w are linearly independent; hence the coefficients in the above relation are each 0: 

x + y + z = 0 
x - y - 2z = 0 

2 = 0 

The only solution to the above system is * = 0, y = 0, z = 0. Hence w + v, u - v and u - 2v + w are 
independent. 

Let vi, v 2 ,...,Vm be independent vectors, and suppose it is a linear combination of 
the Vi, say u = awi + (I2V2 + •■• + OmV m where the a ( are scalars. Show that the 
above representation of u is unique. 

Suppose u = Mi + 62^2 + • • • + *> m v m where the b t are scalars. Subtracting, 
0 = u - u = («! — b l )v l + (a 2 - b 2 )v 2 + • • • + (a m - 6 m )i> m 
But the v { are linearly independent; hence the coefficients in the above relation are each 0: 
a t - 6 t = 0, a 2 — 6 2 = 0, . . ., a m - b m = 0 

Hence a, = 6 lf a 2 = 6 2 , . . ., a m = b m and so the above representation of u as a linear combination 
of the v 4 is unique. 

Show that the vectors v = (1 +t, 2i) and w = (1, in C 2 are linearly dependent 
over the complex field C but are linearly independent over the real field R. 

Recall that 2 vectors are dependent iff one is a multiple of the other. Since the first coordinate 
of w is 1, v can be a multiple of w iff v = (1 + i)w. But 1 + i € R; hence v and w are independent 
overR. Since (l+ t > = (l + i)(l,l + i) = (1 + i, 2t) = i» 

and 1 + ieC, they are dependent over C. 

Suppose S = {vi, ...,Vm) contains a dependent subset, say {v u ...,v r }. Show that 
S is also dependent. Hence every subset of an independent set is independent. 
Since {v lt ...,v r ) is dependent, there exist scalars a u ...,a r , not all 0, such that 

o 1 d 1 + 02^2 + • ■ • + a r v r = 0 



CHAP. 5] 



BASIS AND DIMENSION 



99 



Hence there exist scalars a t a r , 0, . . .,0, not all 0, such that 

a l v l + • • ■ + a T v r + 0v r+1 + ■ ■ ■ + 0v m = 0 

Accordingly, S is dependent. 

5.10. Suppose {vi, . . .,Vm) is independent, but {v u . . .,v m , w) is dependent. Show that w 
is a linear combination of the Vu 

Method 1. Since {v v . . . , v m , w} is dependent, there exist scalars a u a m , b, not all 0, such that 
a 1 v 1 + • • • + a m v m + bw = 0. If 6 = 0, then one of the a ( is not zero and a t v t + • ■ ■ + a m v m = 0. 
But this contradicts the hypothesis that {«„ .. .,v m } is independent. Accordingly, 6#0 and so 

w = b~ l (-a 1 v 1 a m v m ) = -b-^a^ - • • • - 6" 1 o m v m 

That is, w is a linear combination of the v f . 

Method 2. If w = 0, then w = 0v t + • • • + 0v m . On the other hand, if then, by Lemma 

5.2, one of the vectors in {v t , . . . , v m , w] is a linear combination of the preceding vectors. This 
vector cannot be one of the v's since . . . , t> m } is independent. Hence w is a linear combination 
of the i>j. 

PROOFS OF THEOREMS 

5.11. Prove Lemma 5.2: The nonzero vectors V\, . . . , v m are linearly dependent if and only 
if one of them, say v it is a linear combination of the preceding vectors: Vi = 

diVi + • ■ ' + Oi-iVi-i. 

Suppose -Uj = a 1 v 1 + ■•• + o j _ l v l _ 1 . Then 

0l v, + • • • + a i - 1 v i - l — v { + 0v i+l + • • • + 0v m = 0 

and the coefficient of v { is not 0. Hence the v { are linearly dependent. 

Conversely, suppose the v { are linearly dependent. Then there exist scalars a„ .... a m , not all 
0, such that a^v^ + • • • + a m v m — 0. Let fc be the largest integer such that a k ^ 0. Then 

a i v i + ' • • + «kVfc + 0u k + 1 + • • • + 0v m — 0 or a^Vx + • ■ • + a k v k = 0 

Suppose k = 1; then = 0, a t ¥* 0 and so v t = 0. But the v ( are nonzero vectors; hence 
k > 1 and 

v k = -o-'o^! - • • • - a k l a k _ 1 v k _ 1 
That is, v k is a linear combination of the preceding vectors. 

5.12. Prove Theorem 5.1: The nonzero rows Ri, . . .,R n of a matrix in echelon form are 
linearly independent. 

Suppose {R n , R n -i, . . - ,Ri} is dependent. Then one of the rows, say R m , is a linear combination 
of the preceding rows: 

R m — "m + l^m + l + Om + 2^m + 2 + ' " ' + a rfin (*) 

Now suppose the fcth component of R m is its first nonzero entry. Then, since the matrix is in echelon 
form, the fcth components of R m +i, . . - ,R„ are all 0, and so the fcth component of (*) is o m+1 « 0 + 
°m+2* 0 + ' " " + a n' 0 = °- But tms contradicts the assumption that the fcth component of R m is 
not 0. Thus R lt . . . , R n are independent. 

5.13. Suppose {vi, . . ., v m } generates a vector space V. Prove: 

(i) If w GV, then {w, v\, . . . , v m } is linearly dependent and generates V. 

(ii) If Vi is a linear combination of the preceding vectors, then [vi, . . . , vi-u Vt+ 1, . . . , v m } 
generates V. 

(i) If w e V, then w is a linear combination of the v f since {v { } generates V. Accordingly, 
{w, v m } is linearly dependent. Clearly, w with the v { generate V since the v t by them- 
selves generate V. That is, {w, v lt . . ., v m } generates V. 
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(ii) Suppose v { = k^i + • • • + fc t _ 1 i; i _ 1 . Let uGV. Since {v { } generates V, u is a linear com- 
bination of the v u say, u - + • • ■ + a m v m . Substituting for v it we obtain 

u = a^i + • • • + eii-jUj-! + o^fex^! + • • • + fef— i v«— i) + a i+1 i> i+1 + • • • + a m v m 
= (o. t + a i k l )v l + •■• + (a i - 1 + a i k i _ 1 )v i _ 1 + a i + l v i + 1 + ■■■ + a m v m 

Thus {v u . . ., v t _ 1( v i+1 , . . .,v OT > generates V. In other words, we can delete i> { from the gen- 
erating set and still retain a generating set. 



5.14. Prove Lemma 5.4: Suppose {vi, . . .,v n } generates a vector space V. If {wi, . . .,w m } 
is linearly independent, then m ^ n and V is generated by a set of the form 
{wi, . . .,w m ,vi v . . .,Vi n _ m }. Thus, in particular, any + 1 or more vectors in V are 
linearly dependent. 

It suffices to prove the theorem in the case that the v t are all not 0. (Prove!) Since the {uj 
generates V, we have by the preceding problem that 

v lt v n } (1) 

is linearly dependent and also generates V. By Lemma 5.2, one of the vectors in (1) is a linear com- 
bination of the preceding vectors. This vector cannot be w u so it must be one of the v's, say v s . 
Thus by the preceding problem we can delete Vj from the generating set (1) and obtain the generating 

Set {«i, V lt . .., Vj- lt v j+1 , . . ., v n } (2) 

Now we repeat the argument with the vector w 2 . That is, since (2) generates V, the set 

{w u W 2 , V lt . . ., Vj- U v j+ i, ...,v n } (S) 

is linearly dependent and also generates V. Again by Lemma 5.2, one of the vectors in (3) is a linear 
combination of the preceding vectors. We emphasize that this vector cannot be or w 2 since 
{w u w m } is independent; hence it must be one of the v's, say v k . Thus by the preceding problem 
we can delete v k from the generating set (3) and obtain the generating set 

{W 1 ,W 2 ,V 1 , ...,Vj- U V j+u . . ., V k _ 1( Vfc + l, ...,v n ] 

We repeat the argument with w 3 and so forth. At each step we are able to add one of the 
w's and delete one of the v's in the generating set. If m ?z n, then we finally obtain a generating 
set of the required form: 

{w u ...,w m ,v h , ...,v in _J 

Lastly, we show that m > n is not possible. Otherwise, after n of the above steps, we obtain 
the generating set {w lt . . ., w„}. This implies that w n+1 is a linear combination of w u ...,w n which 
contradicts the hypothesis that {wj is linearly independent. 



5.15. Prove Theorem 5.3: Let V be a finite dimensional vector space. Then every basis of 
V has the same number of vectors. 

Suppose {e lt e 2 , . ..,«„} is a basis of V, and suppose {f lt f 2 , . . .} is another basis of V. Since 
{e,} generates V, the basis {/i,/ 2 , • • •} must contain n or less vectors, or else it is dependent by the 
preceding problem. On the other hand, if the basis {/x./a, . . .} contains less than n vectors, then 

{ ei e J is dependent by the preceding problem. Thus the basis {/i,/ 2 , •••} contains exactly n 

vectors, and so the theorem is true. 

5.16. Prove Theorem 5.5: Suppose {v u ...,v m } is a maximal independent subset of a set 
S which generates a vector space V. Then {v u ...,v m } is a basis of V. 

Suppose wSS. Then, since {vj} is a maximal independent subset of S, {v lt ..., v m , w) is 
linearly dependent. By Problem 5.10, w is a linear combination of the v u that is, w G L(i>j). Hence 
S C L(Vi). This leads to V = L(S) C L^) C V. Accordingly, {vj generates V and, since it is in- 
dependent, it is a basis of V. 
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^ 5.17. Suppose V is generated by a finite set S. Show that V is of finite dimension and, in 
particular, a subset of S is a basis of V. 

Method 1. Of all the independent subsets of S, and there is a finite number of them since S is finite, 
one of them is maximal. By the preceding problem this subset of S is a basis of V. 

Method 2. If S is independent, it is a basis of V. If S is dependent, one of the vectors is a linear 
combination of the preceding vectors. We may delete this vector and still retain a generating set. 
We continue this process until we obtain a subset which is independent and generates V, i.e. is a 
basis of V. 

* 5.18. Prove Theorem 5.6: Let V be of finite dimension n. Then: 

(i) Any set of n + 1 or more vectors is linearly dependent. 

(ii) Any linearly independent set is part of a basis. 

(iii) A linearly independent set with n elements is a basis. 

Suppose {e u . . . , e„} is a basis of V. 

(i) Since {e u . . . , c„} generates V, any n + 1 or more vectors is dependent by Lemma 5.4. 

(ii) Suppose {v lt . . . , v r ) is independent. By Lemma 5.4, V is generated by a set of the form 

S = {v u ...,v r , e (i , e in r } 

By the preceding problem, a subset of S is a basis. But S contains n elements and every basis 
of V contains n elements. Thus S is a basis of V and contains {v u . . . , v r } as a subset. 

(iii) By (ii), an independent set T with n elements is part of a basis. But every basis of V contains 
n elements. Thus, T is a basis. 



5.19. Prove Theorem 5.7: Let Wbea subspace of an w-dimensional vector space V. Then 
dim W — n. In particular, if dim W = n, then W = V. 

Since V is of dimension n, any n + lor more vectors are linearly dependent. Furthermore, since 
a basis of W consists of linearly independent vectors, it cannot contain more than n elements. 
Accordingly, dim W — n. 

In particular, if {w u w n } is a basis of W, then since it is an independent set with n elements 
it is also a basis of V. Thus W — V when dim W = n. 



5.20. Prove Theorem 5.8: dim(i7 + W) = dim U + dim W - dim(UnW). 

Observe that U nW is a subspace of both U and 17. Suppose dim U = m, dim W = n and 
dim (17 n W) = r. Suppose {v lt . . ., v r } is a basis of UnW. By Theorem 5.6(H), we can extend {dJ 
to a basis of U and to a basis of W; say, 

{^l v r ,u lt u m _ r } and {v^ ...,v„w lt w,.,} 

are bases of U and W respectively. Let 

B - {v v . . ., v r , u u . w x , . . ., w„_ r } 

Note that B has exactly m + n — r elements. Thus the theorem is proved if we can show that B is a 
basis of U + W. Since {v it Uj} generates 17 and {v { , iv k ) generates W, the union B = {v it Uj, w k } 
generates U+W. Thus it suffices to show that B is independent. 

Suppose 

ajV! + • • • + a r v r + bjii! + • ■ • + 6 m _ r tt m _ r + e t Wi + • • • + c n _ r w n _ r = 0 (1) 
where a i( b } , c k are scalars. Let 

v = a l v l + ■■■ +a r v r + 6 lM , + • • • + 6 m _ r w m _ r (2) 
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By (1), we also have that 



V = -Ciltf! - • • • - c n _ r w n _ r 



(3) 



Since {v^Uj} cU, v e U by (2); and since {w k }oW, v G W by (5). Accordingly, nei/nW. 
Now {v{i is a basis of UnW and so there exist scalars d u . . . , d r for which v = d^i + • • • + d r v r . 
Thus by (3) we have 

d^! + • • • + d r v r + + • - • + c„_ r w n _ r = 0 

But w k } is a basis of W and so is independent. Hence the above equation forces c x = 0, . . . , 
c„_ r = 0. Substituting this into (1), we obtain 

+ • • • + a r v r + &!% + • • • + 6 m _ r w m _ r = 0 

But {v lt Uj} is a basis of U and so is independent. Hence the above equation forces «i = 0, . . . , 
a r = 0, &! = 0, . . . , 6 m _ r = 0. 

Since the equation (1) implies that the a it b s and c k are all 0, B = {v u u jt w k } is independent 
and the theorem is proved. 



5.21. Prove Theorem 5.9: The row rank and the column rank of any matrix are equal. 
Let A be an arbitrary m X n matrix: 







o 12 . 


• "in 




«21 


a 22 


■ °2n 




\a m i 


«m2 • 





Let E v R 2 , ...,B m denote its rows: 

J?l = (tt n , 0 12 , . . . , Oi n ), •••> B m = (*ml> a m2> • • • > °m») 

Suppose the row rank is r and that the following r vectors form a basis for the row space: 

Si = (&n, &12. • • • , &ln)> $2 = (&21> &22 b 2n)> •••> S r = (&rl> & r2> • • • > 6 rn) 

Then each of the row vectors is a linear combination of the S { : 

JZj = knSi + ki 2 S 2 + ' ' ' + fclr^r 
B 2 = fc 21 S! + fc 2 2S 2 + ■ • ' + ^2r^r 



where the fc y are scalars. Setting the ith components of each of the above vector equations equal to 
each other, we obtain the following system of equations, each valid for i = 1, . . . , n: 

«» — + fc 12 6 2i + - • - + klr 6 ri 

a 2i = k 21 b u + fe 2 2&2i + • • • + k 2r b ri 



Thus for i — 1, . . . , w: 



«mi = fc rol&H + fc m2 6 2i + ' " ' + k mr b H 



= 6., 



M 




I** \ 






&21 


+ &2i 


«22 


















In other words, each of the columns of A is a linear combination of the r vectors 









1^12 


I 


K 2i 


■I 










\^m2 
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Thus the column space of the matrix A has dimension at most r, i.e. column rank — r. Hence, column 
rank — row rank. 

Similarly (or considering the transpose matrix A*) we obtain row rank — column rank. Thus 
the row rank and column rank are equal. 



BASIS AND DIMENSION 

5.22. Determine whether or not the following form a basis for the vector space R 3 : 

(i) (1, 1, 1) and (1, -1, 5) (iii) (1, 1, 1), (1, 2, 3) and (2, -1, 1) 

(ii) (1, 2, 3), (1, 0, -1), (3, -1, 0) (iv) (1, 1, 2), (1, 2, 5) and (5, 3, 4) 
and (2, 1, -2) 

(i) and (ii). No; for a basis of R 3 must contain exactly 3 elements, since R 3 is of dimension 3. 

(iii) The vectors form a basis if and only if they are independent. Thus form the matrix whose 
rows are the given vectors, and row reduce to echelon form: 



1 



-1 



to 






/I 


1 




) * 


0 


1 


■) 






0 





The echelon matrix has no zero rows; hence the three vectors are independent and so form a 
basis for R 3 . 

(iv) Form the matrix whose rows are the given vectors, and row reduce to echelon form: 

1 2\ /l 1 2\ h 1 

2 5 to 0 1 3 to o 1 

3 4/ \0 -2 -6/ \o 0 

The echelon matrix has a zero row, i.e. only two nonzero rows; hence the three vectors are 
dependent and so do not form a basis for R 3 . 



5.23. Let W be the subspace of R 4 generated by the vectors (1, —2, 5, —3), (2, 3, 1, —4) and 
(3, 8, —3, —5). (i) Find a basis and the dimension of W. (ii) Extend the basis of W 
to a basis of the whole space R 4 . 

(i) Form the matrix whose rows are the given vectors, and row reduce to echelon form: 

1-2 5 -3\ /l -2 5 3\ /l -2 5 -s\ 

2 3 1 -4 to 0 7 -9 2 to 0 7-9 2 

3 8-3 -5/ \0 14 -18 4/ \0 0 0 0/ 

The nonzero rows (1, —2, 5, —3) and (0, 7, —9, 2) of the echelon matrix form a basis of the row 
space, that is, of W. Thus, in particular, dim W = 2. 

(ii) We seek four independent vectors which include the above two vectors. The vectors (1, —2, 5, —3), 
(0, 7, —9, 2), (0, 0, 1, 0) and (0, 0, 0, 1) are independent (since they form an echelon matrix), and 
so they form a basis of R 4 which is an extension of the basis of W. 



5.24. Let W be the space generated by the polynomials 

vi = t s - 2t 2 + At + 1 v 3 = t 3 + 6t - 5 
v 2 = 2t 3 - St 2 + 9t - 1 Vi = 2t 3 - 5t 2 + It + 5 
Find a basis and the dimension of W. 

The coordinate vectors of the given polynomials relative to the basis {f 3 , t 2 , t, 1} are respectively 
[«J = (1,-2,4,1) [v s ] = (1,0,6,-5) 

[v 2 ] = (2,-3,9,-1) [v 4 ] = (2,-5,7,5) 
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Form the matrix whose rows are the above coordinate vectors, and row reduce to echelon form: 

4 



/I -2 

2 -3 

1 0 

2 -5 



A 
-1 

-5 

5, 



to 




to 



-2 
1 
0 
0 



The nonzero rows (1,-2,4,1) and (0,1,1,-3) of the echelon matrix form a basis of the space 
generated by the coordinate vectors, and so the corresponding polynomials 

& - 2t 2 + 4t + 1 and t 2 + t - 3 



form a basis of W. Thus dim W = 2. 



5.25. Find the dimension and a basis of the solution space W of the system 

x + 2y + 2z - s + St = 0 
x + 2y + 3z + 8 + t = 0 
3a; + 6y + 8z + s + 5t = 0 

Reduce the system to echelon form: 

x + 2y + 2z - s + 3t = 0 

x + 2y + 2z - s + St = 0 

z + 2s — 2t = 0 or 

z + 2s - 2t = 0 

2z + 4s - U = 0 

The system in echelon form has 2 (nonzero) equations in 5 unknowns; hence the dimension of the 
solution space W is 5 — 2 = 3. The free variables are y, s and t. Set 

(i) y = 1, 8 = 0, t = 0, (ii) j/ = 0, s = 1, t = 0, (iii) 2/ = 0, s = 0, t = 1 

to obtain the respective solutions 

Vl = (-2,1,0,0,0), v 2 = (5,0,-2,1,0), v 3 = (-7,0,2,0,1) 

The set {v u v 2 , v 3 } is a basis of the solution space W. 

5.26. Find a homogeneous system whose solution set W is generated by 

{(1, -2, 0, 3), (1, -1, -1, 4), (1, 0, -2, 5)} 

Method 1. Let v = (x, y, z, w). Form the matrix M whose first rows are the given vectors and whose 
last row is v; and then row reduce to echelon form: 

ll -2 0 3 

0 1-1 1 
0 0 2x + y + z —5x — y + w 
\0 0 0 0 

The original first three rows show that W has dimension 2. Thus v&W if and only if the addi- 
tional row does not increase the dimension of the row space. Hence we set the last two entries 
in the third row on the right equal to 0 to obtain the required homogeneous system 

2x + y + z =0 

5x + y —w = 0 

Method 2. We know that v = (x,y,z,w) S W if and only if v is a linear combination of the gen- 
erators of W: ,„ „ _ v 
(*, y, z, w) = r(l, -2, 0, 3) + s(l, -1, -1, 4) + t(l, 0, -2, 5) 




M = to I „ „ „ to 



The above vector equation in unknowns r, s and t is equivalent to the following system: 
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r + s + t = x r + s + t = x r + s + t = x 

-2r - s = y s + 2t = 2x + y s + 2t = 2x + y 

-s-2t = z or -s - 2i = z ° r 0 = 2x + y + z (J) 

3r + 4s + 5t = w s + 2t = w — 3x 0 = 5x + y — w 

Thus v €. W if and only if the above system has a solution, i.e. if 

2x + y + z =0 

5x + y — w = 0 

The above is the required homogeneous system. 

Remark: Observe that the augmented matrix of the system (1) is the transpose of the matrix 
M used in the first method. 



5.27. Let U and W be the following subspaces of R 4 : 

U = {(a,b,c,d): b + c + d = 0}, W = {(a,b,c,d) : a + b = 0, c ■ = 2d} 
Find the dimension and a basis of (i) U, (ii) W, (iii) UnW. 

(i) We seek a basis of the set of solutions (a, 6, c, d) of the equation 

6 + c + d = 0 or 0-a + b + e + d = 0 
The free variables are a, c and d. Set 

(J) a = 1, e = 0, d = 0, (2) a = 0, e = 1, d = 0, (*) a = 0, c = 0, d = 1 
to obtain the respective solutions 

Vl = (1,0,0,0), v 2 = (0,-1,1,0), v 3 = (0,-1,0,1) 
The set {v v v 2 , v 3 } is a basis of U, and dim [7 = 3. 

(ii) We seek a basis of the set of solutions (a, b, e, d) of the system 

a + b = 0 a + b = 0 

or 

c = 2d c - 2d = 0 

The free variables are 6 and d. Set 

(1) b = 1, d = 0, (2) 6 = 0, d = 1 
to obtain the respective solutions 

v t = (-1,1,0,0), v 2 = (0,0,2,1) 

The set {v lt v 2 } is a basis of W, and dim W = 2. 

(iii) UnW consists of those vectors (a, 6, c, d) which satisfy the conditions denning U and the con- 
ditions defining W, i.e. the three equations 

6+c+d=0 a+b =0 

o+6 =0 or & + c+d = 0 

c = 2d c - 2d = 0 

The free variable is d. Set d = 1 to obtain the solution v = (3, -3, 2, 1). Thus {v} is a basis 
of UnW, and dim(E7nW0 = l. 



5.28. Find the dimension of the vector space spanned by: 

1 2 



(i) (1, -2,3, -1) and (1, 1, -2, 3) (v) ^ g J and ^ g 

(ii) (3, -6, 3, -9) and (-2, 4, -2, 6) / 1 l\ /-3 -3 

(iii) t 3 + 2t 2 + Bt + l and 2£ 3 + 4t 2 + 6t + 2 ^ -l 

(iv) t 3 -2t 2 + 5 and t 2 + 3t - 4 (vii) 3 and -3 
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Two nonzero vectors span a space W of dimension 2 if they are independent, and of dimension 
1 if they are dependent. Recall that two vectors are dependent if and only if one is a multiple of 
the other. Hence: (i) 2, (ii) 1, (iii) 1, (iv) 2, (v) 2, (vi) 1, (vii) 1. 

5.29. Let V be the vector space of 2 by 2 symmetric matrices over K. Show that 
dim 7 = 3. (Recall that A = (a i} ) is symmetric iff A = A* or, equivalently, ay = a j{ .) 



An arbitrary 2 by 2 symmetric matrix is of the form A = 
(Note that there are three "variables".) Setting 




where a, 6, c G K. 



(i) a = 1, 6 = 0, c = 0, (ii) a = 0, b = 1, e = 0, (iii) a = 0, 6 = 0, c = 1 
we obtain the respective matrices 

-■■(:»• *-GD- --G!) 

We show that B 2 , £? 3 } is a basis of V, that is, that it (1) generates V and (2) is independent. 

(1) For the above arbitrary matrix A in V, we have 

A = ^ bS j = aE x + bE 2 + cE s 

Thus {E lt E 2 , E s } generates V. 

(2) Suppose xEi + yE 2 + zE 3 = 0, where x, y, z are unknown scalars. That is, suppose 

-G -go - (»°°) - gd-g:) 

Setting corresponding entries equal to each other, we obtain * = 0, y = 0, z = 0. In other words, 

xEi + yE 2 + zE s = 0 implies « = 0, y = 0, z = 0 
Accordingly, {E U E 2 ,E S } is independent. 

Thus {E lt E 2 , E 3 } is a basis of V and so the dimension of V is 3. 

5.30. Let V be the space of polynomials in t of degree ^ n. Show that each of the following 
is a basis of V: 

(i) {1, t, t\ . . . , *-\ t"}, (ii) {1, 1 - 1, (1 - 1) 2 , . . . , (1 - 1)-" 1 , (1 - 1)"}. 
Thus dim V = n + 1. 

(i) Clearly each polynomial in V is a linear combination of \,t, 1 and t». Furthermore, 
1, t, . . ., f _1 and t» are independent since none is a linear combination of the preceding poly- 
nomials. Thus {1, t #»} is a basis of V. 

(ii) (Note that by (i), dim V = ra+1; and so any n+ 1 independent polynomials form a basis of 

V.) Now each polynomial in the sequence 1, 1 - 1 (1 - 1)» is of degree higher than the 

preceding ones and so is not a linear combination of the preceding ones. Thus the n + 1 poly- 
nomials 1, 1 — t, . . . , (1 — t) n are independent and so form a basis of V. 

5.31. Let V be the vector space of ordered pairs of complex numbers over the real field R 
(see Problem 4.42). Show that V is of dimension 4. 

We claim that the following is a basis of V: 

B = {(1, 0), (i, 0), (0, 1), (0, 0} 
Suppose v S V. Then v = {z, w) where z, w are complex numbers, and so v = (a + bi, e + di) where 
o, 6, c, d are real numbers. Then 

v = o(l, 0) + b(i, 0) + c(0, 1) + d(0, t) 

Thus £ generates V. 
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The proof is complete if we show that B is independent. Suppose 
os^l, 0) + x 2 (i, 0) + * 3 (0, 1) + x 4 (Q, i) = 0 
where x lt x 2 , x 3> x 4 G R - Tnen 

C »j + x 2 i = 0 
(*! + x 2 i, x 3 + z 4 i) = (0, 0) and so ^ 

[ *3 + a; 4 t = 0 

Accordingly x x = 0, x 2 = 0, a 3 = 0, x 4 = 0 and so £ is independent. 

5.32. Let V be the vector space of m x n matrices over a field K. Let 2£« £ V be the matrix 
with 1 as the #-entry and 0 elsewhere. Show that {E i} } is a basis of 7. Thus 
dim V = mn. 

We need to show that {£y} generates V and is independent. 

Let A = (a y ) be any matrix in V. Then A - 2 aytfy. Hence {By} generates V. 

Now suppose that 2 *y#y = 0 where the x tj are scalars. The y-entry of 2 %h e h is * n(1 

the y-entry of 0 is 0. Thus x i} = 0, i = 1, . . ., w, j = 1 n. Accordingly the matrices E tj are 

independent. 

Thus {£?y} is a basis of V. 

Remark: Viewing a vector in K n as a 1 X n matrix, we have shown by the above result that the 
usual basis denned in Example 5.3, page 88, is a basis of X" and that dim K n = rt. 



SUMS AND INTERSECTIONS 

5.33. Suppose U and W are distinct 4-dimensional subspaces of a vector space V of dimen- 
sion 6. Find the possible dimensions of UriW. 

Since U and W are distinct, U+W properly contains U and W; hence dim (17 + W) > 4. 
But dim. (U+W) cannot be greater than 6, since dimV = 6. Hence we have two possibilities: 
(i) dim (U+W) = 5, or (ii) dim (U + W) = 6. Using Theorem 5.8 that dim (17 + W) = dim U + 
dim W - dim (Un W), we obtain 

(i) 5 = 4 + 4 -dim(t/nW) or dim(J7nW) = 3 

(ii) 6 = 4 + 4 - dim(l7nW0 or dim(UnW) = 2 
That is, the dimension of 17 n W must be either 2 or 3. 



5.34. Let U and W be the subspaces of R 4 generated by 

{(1, 1, 0, -1), (1, 2, 3, 0), (2, 3, 3, -1)} and {(1, 2, 2, -2), (2, 3, 2, -3), (1, 3, 4, -3)} 
respectively. Find (i) dim (C7 + W), (ii) dim (C7 n W). 

(i) £7+ W is the space spanned by all six vectors. Hence form the matrix whose rows are the 
given six vectors, and then row reduce to echelon form: 




Since the echelon matrix has three nonzero rows, dim (U + W) — B. 
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(ii) First find dim U and dim W. Form the two matrices whose rows are the generators of U and 
W respectively and then row reduce each to echelon form: 



and 




to 



to 




to 



to 




Since each of the echelon matrices has two nonzero rows, dim U = 2 and dim W = 2. Using 
Theorem 5.8 that dim (U+W) = dim U + dim W - dim (Z7n W), we have 

3 = 2 + 2 - dim(UnW) or dim(Ur\W) = 1 



5.35. Let U be the subspace of R 5 generated by 

{(1, 3, -2, 2, 3), (1, 4, -3, 4, 2), (2, 3, -1, -2, 9)} 
and let W be the subspace generated by 

{(1, 3, 0, 2, 1), (1, 5, -6, 6, 3), (2, 5, 3, 2, 1)} 
Find a basis and the dimension of (i) U + W, (ii) U n W. 

(i) £7 + is the space generated by all six vectors. Hence form the matrix whose rows are the 
six vectors and then row reduce to echelon form: 



to 



to 




to 




The set of nonzero rows of the echelon matrix, 

{(1, 3, -2, 2, 3), (0, 1, -1, 2, -1), (0, 0, 2, 0, -2)} 
is a basis of U+W; thus dim {U + W) = 3. 

(ii) First find homogeneous systems whose solution sets are U and W respectively. Form the matrix 
whose first rows are the generators of U and whose last row is (x, y, z, s, t) and then row reduce 
to echelon form: 




3 


-2 


2 


3 


1 


-1 


2 


-1 


-3 


3 


-6 


3 


3a; + 


y 2x + z 


-2x + s 


-3a; + t 




2 


-I ^ 






2 






4a; - 


2y + s - 


-6x + y + t 


I 




0 


0 / 





CHAP. 5] 



BASIS AND DIMENSION 



109 



Set the entries of the third row equal to 0 to obtain the homogeneous system whose solution 
set is U: 

-x + y + z = 0, Ax - 2y + s = 0, -6x + y + t = 0 

Now form the matrix whose first rows are the generators of W and whose last row is 
(x, y, z, s, t) and then row reduce to echelon form: 




/I 
0 

0 

\o 



to 



3 
2 
-1 
-3* + y 

2 




0 -9x + 3y + z Ax - 2y + s 2x - y + t 
0 0 0 0 



Set the entries of the third row equal to 0 to obtain the homogeneous system whose solution 
set is W' 

-9x + Sy + z = 0, Ax — 2y + a = 0, 2x - y + t = 0 

Combining both systems, we obtain the homogeneous system whose solution set is V n W: 

-x + y + z =0 

2y + 4z + s =0 

-5y - 6z + t = 0 

-Gy - 8z =0 

2y + 4z + « =0 

y + 2z + t = 0 



—x + y+z =0 
2y + 4z+a =0 
° r 8z + 5s + 2t = 0 

s - 2t = 0 



— x + y + z 




0 


Ax - 2y +8 




0 


-6x + y + t 




0 


' -9x + 3y + z 




0 


Ax — 2y +8 




0 


2x- y + t 




0 


—x + y+ z 




0 


2y + Az + s 




0 


8z + 5s + 2t 




0 


4z + 38 




0 


8 - 2t 




0 



There is one free variable, which is t; hence dim(UnW) = 1. Setting t = 2, we obtain the 
solution * = 1, y = 4, z = -3, 8 = 4, t = 2. Thus {(1, 4, -3, A, 2)} is a basis of 17 n W. 



COORDINATE VECTORS 

5.36. Find the coordinate vector of v relative to the basis {(1, 1, 1), (1, 1, 0), (1, 0, 0)} of R 3 
where (i) v = (4, -3, 2), (ii) v = (a, b, c). 

In each case set v as a linear combination of the basis vectors using unknown scalars x, y and z: 

v = 1, 1) + y(l, 1, 0) + z(l, 0, 0) 

and then solve for the solution vector (x,y,z). (The solution is unique since the basis vectors are 
linearly independent.) 

(i) (4,-3,2) = *(1, 1, 1) + 1/(1, 1, 0) + z(l, 0, 0) 

= (*, x, x) + {y, y, 0) + (z, 0, 0) 

= (x + y + z, x + y,x) 

Set corresponding components equal to each other to obtain the system 

x + y + z — A, x + y = —3, x = 2 

Substitute x = 2 into the second equation to obtain y = —5; then put x - 2, y = — 5 into 
the first equation to obtain z = 7. Thus * = 2, y = -5, z = 7 is the unique solution to the 
system and so the coordinate vector of v relative to the given basis is [v] = (2, —5, 7). 
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(ii) (0,6,0)'.= *(1, 1, 1) + y(l, 1, 0) + 0, 0) = (x + y + z, x + y, x) 
Then x + y+ z = a, x + y = b, x — c 

from which x = c, y - b — c, z — a—b. Thus [v] = (c,b — c,a- b), that is, [(a, 6, c)] = 
(c, 6 — c, a — 6). 

5.37. Let V be the vector space of 2 x 2 matrices over R. Find the coordinate vector of the 
matrix A G V relative to the basis 

Set A as a linear combination of the matrices in the basis using unknown scalars x, y, z, w: 

* - G-?) - ■(; :) + »g + *(J "J) + "(5 0) 

- C :)♦ ci) + (:-.). + C :) 

_ /x + z + ttf ac — j/ — z\ 

V * + y * / 

Set corresponding entries equal to each other to obtain the system 

* + z + w = 2, x — y — z = 3, x + y — 4, £ = — 7 

from which a; = -7, j/ = 11, z = -21, m> = 30. Thus [A] = (-7, 11, -21, 30). (Note that the co- 
ordinate vector of A must be a vector in R 4 since dim V = 4.) 



5.38. Let W be the vector space of 2 x 2 symmetric matrices over R. (See Problem 5.29.) 

4 -11 
-11 -7 



4 —11 1 

Find the coordinate vector of the matrix A - I _ „ ) relative to the basis 




Set A as a linear combination of the matrices in the basis using unknown scalars x, y and z: 
/ 4 -11\ / 1 -2\ _, /2 1\ , / 4 —IN fx + 2y + Az -2x + y - z\ 

A = (-n - 7 j = x {-2 i) + y {i 3 j + z U -5) = (,-*.+»-. .+«,-«.; 

Set corresponding entries equal to each other to obtain the equivalent system of linear equations 
and reduce to echelon form: 

x + 2v + 4z = 4 

* x + 2y + 4z = 4 x + 2^/ + 4z = 4 

-2*+ y- 2 - -11 ^ 5j / + 7z = -3 or 5y + 7z = -3 

—2x+ y — z = —11 „ co co 

j/ — 9z = —11 52z = 52 

x + Sy - 5z = -7 

We obtain z = 1 from the third equation, then y — — 2 from the second equation, and then x = 4 
from the first equation. Thus the solution of the system is x = 4, j/ = —2, as = 1; hence [A] = 
(4, —2, 1). (Since dim W = 3 by Problem 5.29, the coordinate vector of A must be a vector in K 3 .) 



5.39. Let {ei, e 2 , e 3 } and /z, /s} be bases of a vector space 7 (of dimension 3). Suppose 

ei = ai/i + 02/2 + 03/3 

e 2 = + O2/2 + 03/3 (*) 

e3 = C1/1 + C2/2 + C3/3 

Let P be the matrix whose rows are the coordinate vectors of ei, e* and e 3 respectively, 
relative to the basis {/i}: 
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Show that, for any vector v GV, [v] e P = [v] f . That is, multiplying the coordinate 
vector of v relative to the basis {ei) by the matrix P, we obtain the coordinate vector 
of v relative to the basis {/j}. (The matrix P is frequently called the change of basis 
matrix.) 

Suppose v = re 1 + se 2 + te 3 ; then [v] e — (r,s,t). Using (1), we have 

v = r(a 1 f 1 + a 2 f 2 + a 3 f 3 ) + s(b 1 f 1 + b 2 f 2 + 63/3) + HfiJi + ej 2 + C3/3) 
= (ra x + s&! + tci)/ t + (ra 2 + sb 2 + te 2 )/ 2 + (ra 3 + sb 3 + tc 3 )f 3 
Hence [v]f — (ra x + s&j + te u ra 2 + sb 2 + tc 2 , ra 3 + sb 3 + tc 3 ) 

On the other hand, 1 ^ Gj > 

H e P = (r, a, t) &! 6 2 6 3 

\ "2 c 3 

= (ra! + s&! + tc 1( ro 2 + s& 2 + tc 2 , ro 3 + sb 3 + tc 3 ) 

Accordingly, [v] e P — [v] f . 

Remark: In Chapter 8 we shall write coordinate vectors as column vectors rather than row 
vectors. Then, by above, 

' ra t + sb 1 + tc^ 





/<ll 


6l 




fr\ 




QMe = 


a 2 












\«3 











ra 2 + s6 2 + tc 2 
ra 3 + sb 3 + tc 3 / 



where Q is the matrix whose columns are the coordinate vectors of e v e 2 and e s respectively, relative 
to the basis {/J. Note that Q is the transpose of P and that Q appears on the left of the column 
vector [v] e whereas P appears on the right of the row vector [v] e . 



RANK OF A MATRIX 

5.40. Find the rank of the matrix A where: 



(i) A = 



1 3 1-2 -3\ 

1 4 3-1-4 

2 3-4-7 -3 
,3 8 1-7 -8/ 



(ii) A = 



! 1 


2 




2 


1 




-2 


-1 


1 


\-i 


4 





(iii) A = 



I 1 3\ 
0 -2 
5 -1 



(i) Row reduce to echelon form: 



fl 


3 


1 


-2 


~ 3 \ 


1 


4 


3 


-1 


-4 


2 


3 


-4 


-7 


-3 


\3 


8 


1 


-7 


-8/ 



to 




to 



Since the echelon matrix has two nonzero rows, rank (A) = 2. 



/I 
0 
0 



1 -2 -3\ 

2 1-1 
0 0 0 
0 0 0i 



(ii) Since row rank equals column rank, it is easier to form the transpose of A and then row 
reduce to echelon form: 




to 




to 




Thus rank (A) = 3. 
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(iii) The two columns are linearly independent since one is not a multiple of the other. Hence 
rank (A) = 2. 



5.41. Let A and B be arbitrary matrices for which the product AB is denned. Show that 
rank (AB) ^ rank (J?) and rank (AB) — rank (A). 

By Problem 4.33, page 80, the row space of AB is contained in the row space of B; hence 
rank (AB) — rank (B). Furthermore, by Problem 4.71, page 84, the column space of AB is contained 
in the column space of A; hence rank (AB) — rank (A). 



5.42. Let A be an «-square matrix. Show that A is invertible if and only if rank (A) = n. 

Note that the rows of the %-square identity matrix /„ are linearly independent since /„ is in 
echelon form; hence rank (7„) = n. Now if A is invertible then, by Problem 3.36, page 57, A is row 
equivalent to /„; hence rank (A) = n. But if A is not invertible then A is row equivalent to a matrix 
with a zero row; hence rank (A) < n. That is, A is invertible if and only if rank (A) = n. 



5.43. Let Xi v x^, . . . , x ik be the free variables of a homogeneous system of linear equations 
with n unknowns. Let Vj be the solution for which: x tj — 1, and all other free varia- 
bles = 0. Show that the solutions v\, v 2 , . . . , v k are linearly independent. 

Let A be the matrix whose rows are the v { respectively. We interchange column 1 and column 
i u then column 2 and column i 2 , . . ., and then column k and column i k ; and obtain the kXn matrix 

/l 0 0 ... 0 0 c 1>k + 1 ... c lB \ 
B _ (/C) _ 0 1 0 ... 0 0 c 2>fc + 1 ... c 2n 

\0 0 0 ... 0 1 c kik + 1 ... c k J 

The above matrix B is in echelon form and so its rows are independent; hence rank (B) = k. Since 
A and B are column equivalent, they have the same rank, i.e. rank (A) = k. But A has k rows; 
hence these rows, i.e. the v { , are linearly independent as claimed. 



MISCELLANEOUS PROBLEMS 

5.44. The concept of linear dependence is extended to every set of vectors, finite or infinite, 
as follows: the set of vectors A = {vt} is linearly dependent iff there exist vectors 
Vi v . . .,Vi n G A and scalars Oi, . . . , On S K, not all of them 0, such that 

aiVi x + a 2 Vi 2 + ■ ■ ■ + a„Vi n = 0 

Otherwise A is said to be linearly independent. Suppose that A\, A2, . . . are linearly 
independent sets of vectors, and that AiCA 2 C---. Show that the union A = 
A1UA2U ■ ■ ■ is also linearly independent. 

Suppose A is linearly dependent. Then there exist vectors v n G A and scalars 

o„ G K, not all of them 0, such that 

a l v l + a 2 v 2 + • • • + a n v n = 0 (1) 

Since A = L>A t and the v { £ A, there exist sets A { , . . ., A^ such that 

Let k be the maximum index of the sets A,.: k — max (i u . .., i n ). It follows then, since A t C A 2 c • ■ • , 
that each A { . is contained in A k . Hence v v v 2 , ...,«,£A t and so, by (1), A k is linearly dependent, 
which contradicts our hypothesis. Thus A is linearly independent. 
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5.45. Consider a finite sequence of vectors S— {vi,v 2 , . . .,v n ). Let T be the sequence of 
vectors obtained from S by one of the following "elementary operations": (i) inter- 
change two vectors, (ii) multiply a vector by a nonzero scalar, (iii) add a multiple of 
one vector to another. Show that S and T generate the same space W . Also show 
that T is independent if and only if S is independent. 

Observe that, for each operation, the vectors in T are linear combinations of vectors in S. On 
the other hand, each operation has an inverse of the same type (Prove!); hence the vectors in S are 
linear combinations of vectors in T. Thus S and T generate the same space W. Also, T is inde- 
pendent if and only if dim W — n, and this is true iff S is also independent. 

5.46. Let A — (ay) and B = (&«) be row equivalent m x n matrices over a field K, and let 
Vi, . . . , v n be any vectors in a vector space V over K. Let 

Ml — auVi + di 2 V 2 + • • ■ + OlnVn W\ = buVi + bi 2 V 2 + ■ ■ ■ + binVn 

u 2 - a 2 iVi + a 22 V2 + • • • + a 2n v n w 2 = b 2i vi + b 22 v 2 + ■ ■ ■ + b 2 „v n 

Um - OmlVl + a m2 V 2 + ■ • • + (LmnVn Wm = b m \Vi + b m2 V 2 + • • • + b mn V n 

Show that {uu) and {wi} generate the same space. 

Applying an "elementary operation" of the preceding problem to {ttj is equivalent to applying 
an elementary row operation to the matrix A. Since A and B are row equivalent, B can be obtained 
from A by a sequence of elementary row operations; hence {wj can be obtained from {mJ by the 
corresponding sequence of operations. Accordingly, {mJ and {wj generate the same space. 

5.47. Let vi, . . .,v n belong to a vector space V over a field K. Let 

wi = auvi + auv 2 + ■ • • + ainVn 
w 2 = a 2 iVi + 02202 + • • • + a 2n v n 



Wn = OnlVi + CU2V2 + • • • + a„„V„ 

where a« G K. Let P be the n-square matrix of coefficients, i.e. let P = (an). 

(i) Suppose P is invertible. Show that {Wi} and {Vi} generate the same space; hence 
{Wi} is independent if and only if {Vi} is independent. 

(ii) Suppose P is not invertible. Show that {wi) is dependent. 

(iii) Suppose {Wi} is independent. Show that P is invertible. 

(i) Since P is invertible, it is row equivalent to the identity matrix /. Hence by the preceding 
problem {wj and {v { } generate the same space. Thus one is independent if and only if the 
other is. 

(ii) Since P is not invertible, it is row equivalent to a matrix with a zero row. This means that 

generates a space which has a generating set of less than n elements. Thus {wj is 
dependent. 

(iii) This is the contrapositive of the statement of (ii), and so it follows from (ii). 

5.48. Suppose V is the direct sum of its subspaces U and W, i.e. V = U © W. Show that: 
(i) if {Mi, . . .,Um}dU and {w u . . .,w n }cW are independent, then {iM.Wj) is also 
independent; (ii) dim V = dim U + dim W. 

(i) Suppose + • • • + a m u m + Ojitij + • • • + b„w n = 0, where a it b s are scalars. Then 
0 = (etjtt! + 1- a m u m ) + H 1- b n w n ) = 0 + 0 
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where 0, + • ■ • + a m u m G U and 0, b l w 1 + • • • + b n w n G W. Since such a sum for 0 is 
unique, this leads to 

Oj% + • • • + a m u m = 0, + • • • + 6„w n - 0 

The independence of the m 4 implies that the a ; are all 0, and the independence of the w s implies 
that the bj are all 0. Consequently, {u^Wj} is independent. 

(ii) Method 1. Since V = U © W, we have V = U+W and UnW = {0}. Thus, by Theorem 
5.8, page 90, 

dirnF = dim U + dim W - dim (UnW) = dim U + dim W - 0 = dim U + dim W 

Method 2. Suppose {u u . . . , u r } and {w u . . ., w s } are bases of U and W respectively. Since 
they generate U and W respectively, {u it it> 3 } generates V = V + W. On the other hand, by 
(i), {mj, v>j} is independent. Thus {u u w } } is a basis of V; hence dim V = dim U + dim W. 



5.49. Let U be a subspace of a vector space V of finite dimension. Show that there exists 
a subspace W of V such that F = U@W. 

Let . . . , u r } be a basis of U. Since {u-} is linearly independent, it can be extended to a 

basis of V, say, {wj u„ w t w,}. Let W be the space generated by {w lt ...,w s }. Since 

{«,,«!(} generates V, V = V + W. On the other hand, V n W = {0} (Problem 5.62). Accordingly, 

7 = J7 e w. 

5.50. Recall (page 65) that if K is a subfield of a field E (or: is an extension of K), then 
E may be viewed as a vector space over X. (i) Show that the complex field C is a 
vector space of dimension 2 over the real field E. (ii) Show that the real field R is a 
vector space of infinite dimension over the rational field Q. 

(i) We claim that {!,%} is a basis of C over R. For if t/6C, then v = a + bi = a-l + b-i 
where a, 6 G R; that is, {l,i} generates C over R. Furthermore, if x'l + yi = 0 or 
x + yi = 0, where a;,j/GR, then as = 0 and y = 0; that is, {l,i} is linearly independent 
over R. Thus {1, i} is a basis of C over R, and so C is of dimension 2 over R. 

(ii) We claim that, for any n, {1, v, v 2 , . . . , u-"} is linearly independent over Q. For suppose 
a„l + o l5 r + a 2 jr 2 + ■ ■ • + a n ir n = 0, where the a { G Q, and not all the a t are 0. Then v is a 
root of the following nonzero polynomial over Q: a 0 + + a 2 x 2 + • • • + a„x". But it can be 
shown that v is a transcendental number, i.e. that it is not a root of any nonzero polynomial 
over Q. Accordingly, the n+1 real numbers l,ir,v 2 , ...,v n are linearly independent over Q. 
Thus for any finite n, R cannot be of dimension n over Q, i.e. R is of infinite dimension over Q. 

5.51. Let K be a subfield of a field L and L a subfield of a field E: KcLcE. (Hence K is a 
subfield of E.) Suppose that E is of dimension n over L and L is of dimension m 
over K. Show that E is of dimension mn over K. 

Suppose {v v . . .,«„} is a basis of S over L and {a 1( . . .,a m } is a basis of L over X. We claim 
that {apj-. t = l, ...,m, J = 1, ...,«} is a basis of £7 over K. Note that {a^} contains mn 
elements. 

Let w be any arbitrary element in E. Since {t> lt . . ., v n } generates E over L, w is a linear com- 
bination of the v t with coefficients in L: 

w = + b 2 v 2 + • • • + b n v n , b t G L (1) 

Since {o 1( ...,tt m } generates L over X, each 6 { G L is a linear combination of the a 5 with co- 
efficients in K: 

b 1 = fcii«i + k 12 a 2 + • • ■ + k lm a m 
b 2 = fc2i a i + k 22 a 2 + • • • + k 2m a m 



b n = Kl a l + k n2 a 2 + ■■■ + Km a m 
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where k tj G K. Substituting in (i), we obtain 

w = (k u a t H + k lm a m )v 1 + (k 2l a x + • • • + k 2m a m )v 2 + • • • + (fc nl a 1 + • • • + k nm a m )v n 

= fe il<Vi + • * + k lm a m v 1 + k 2l a t v 2 + • • • + k 2m a m v 2 + • • • + k nX a 1 v n + • • • + k nm a m v n 

where k jt e K. Thus w is a linear combination of the a t Vj with coefficients in K; hence {a { Vj} gen- 
erates E over K. 

The proof is complete if we show that {a t Vj} is linearly independent over K. Suppose, for scalars 
Xji GK, 2 = 0; that is, 

(xndM + x l2 a 2 v x + •■■ + x^dnVj + • • - + (x^a^ + x n2 a 2 v n + ■■■ + x nm a m v n ) = 0 

or (x nai + x 12 a 2 + ••■+ *i m a m )v 1 + • • • + (x^ + x n2 a 2 + ■■■ + x nm a m )v„ = 0 

Since {v lt ...,«„} is linearly independent over L and since the above coefficients of the v { belong to 
L, each coefficient must be 0: 

*n«i + *12<*2 + • • • + x lm a m = 0 x nl O! + x n2 a 2 + • • • + x nm a m = 0 

But {oi, . . . , a m } is linearly independent over K; hence since the x H £ K, 

*n = 0, x i2 = 0, x im = 0, » nl = 0, x n2 = 0, x nm = 0 

Accordingly, {ajVj} is linearly independent over K and the theorem is proved. 



Supplementary Problems 

LINEAR DEPENDENCE 

5.52. Determine whether w and v are linearly dependent where: 

(i) it = (1, 2, 3, 4), v = (4, 3, 2, 1) (iii) it = (0, 1), l> = (0, -3) 

(ii) u = (-1, 6, -12), v = (£, -3, 6) (iv) u = (1, 0, 0), t> = (0, 0, -3) 

« i) « "C. :)--(-JD 

(vii) it = -* 3 + £t 2 - 16, v = £i 3 - it 2 + 8 (viii) m = « 3 + 3t + 4, v = t 3 + 4t + 3 

5.53. Determine whether the following vectors in R* are linearly dependent or independent: (i) (1, 3, —1, 4), 
(3,8,-5,7), (2,9,4,23); (ii) (1,-2,4,1), (2,1,0,-3), (3,-6,1,4). 

5.54. Let V be the vector space of 2 X 3 matrices over R. Determine whether the matrices A,B,C GV 
are linearly dependent or independent where: 



5.55. Let V be the vector space of polynomials of degree — 3 over R. Determine whether it, v, w G V are 
linearly dependent or independent where: 

(i) u = t 3 - 4t 2 + 2* + 3, v = t 3 + 2t 2 + 4t - 1, w = 2t s - t 2 - Zt + 5 

(ii) u = t 3 - 5t 2 - 2t + 3, v = t s - 4t 2 - 3t + 4, w = 2* 3 - 7t 2 - It + 9 
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5.56. Let V be the vector space of functions from R into R. Show that f,g,h e V are linearly independ- 
ent where: (i) f(t) = e*, g(t) = sin t, h(t) = t 2 ; (ii) f(t) = «*, g(t) = e 2 *, hit) = t; (iii) f(t) = e«, 
#(£) = sin t, h(t) — cos f. 

5.57. Show that: (i) the vectors (1 — i, i) and (2, — 1 + i) in C 2 are linearly dependent over the complex 
field C but are linearly independent over the real field R; (ii) the vectors (3 + 1 + \/2 ) and 
(7, 1 + 2V2 ) in R 2 are linearly dependent over the real field R but are linearly independent over the 
rational field Q. 

5.58. Suppose u, v and w are linearly independent vectors. Show that: 

(i) u + v — 2w, u — v — w and u + w are linearly independent; 

(ii) u + v — 3w, u + 3d — w and v + w are linearly dependent. 

5.59. Prove or show a counterexample: If the nonzero vectors u, v and w are linearly dependent, then w 
is a linear combination of u and v. 

5.60. Suppose v v v 2 , . . . , v n are linearly independent vectors. Prove the following: 

(i) a 2 v 2 , . . . , a n v n } is linearly independent where each a { # 0. 

(ii) {v lt . . .,v i -i,w,v i+1 , . . .,v n } is linearly independent where w = b 1 v 1 + • • • + 6^ + • • • + b n v n 
and b { 0. 

5.61. Let v = (a, b) and w — (c, d) belong to K 2 . Show that {v, w} is linearly dependent if and only if 



5.62. Suppose {u u . . .,u r ,w 1 , . . .,w s } is a linearly independent subset of a vector space V. Show that 
L(ttj) n L(u>j) = {0}. (Recall that L(tt { ) is the linear span, i.e. the space generated by the u { .) 

5.63. Suppose (a n , ...,a ln ), ...,(a ml , ...,a mn ) are linearly independent vectors in K n , and suppose 
v 1 v n are linearly independent vectors in a vector space V over K. Show that the vectors 

w l = a u Vi + • • • + a ln v n , w m = a^v^ + ■ • ■ + a mn v n 

are also linearly independent. 

BASIS AND DIMENSION 

5.64. Determine whether or not each of the following forms a basis of R 2 : 

(i) (1, 1) and (3, 1) (iii) (0, 1) and (0, -3) 

(ii) (2, 1), (1, -1) and (0, 2) (iv) (2, 1) and (-3, 87) 

5.65. Determine whether or not each of the following forms a basis of R 3 : 

(i) (1, 2, -1) and (0, 3, 1) 

(ii) (2, 4, -3), (0, 1, 1) and (0, 1, -1) 

(iii) (1, 5, -6), (2, 1, 8), (3, -1, 4) and (2, 1, 1) 

(iv) (1, 3, -4), (1, 4, -3) and (2, 3, -11) 

5.66. Find a basis and the dimension of the subspace W of R 4 generated by: 

(i) (1, 4, -1, 3), (2, 1, -3, -1) and (0, 2, 1, -5) 

(ii) (1, -4, -2, 1), (1, -3, -1, 2) and (3, -8, -2, 7) 

5.67. Let V be the space of 2 X 2 matrices over R and let W be the subspace generated by 



ad — be — 0. 




and 




Find a basis and the dimension of W. 



5.68. 



Let W be the space generated by the polynomials 

u = i 3 + 2t 2 - 2t + 1, v = 1? + 3t 2 - t + 4 and w = 2t 3 + t 2 - 7t - 7 



Find a basis and the dimension of W. 
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5.69. Find a basis and the dimension of the solution space W of each homogeneous system: 

x + Zy + 2z = 0 x - 2y + 7z = 0 

x + 4y + 2z = 0 

a; + 5y + z — 0 2x + 3y - 2z = 0 

2x + « + 5z = 0 

3* + 5j/ + 8z = 0 2x - y + z = 0 

(i) (ii) (iii) 

5.70. Find a basis and the dimension of the solution space W of each homogeneous system: 

x + 2y - 2z + 2s - t = 0 x + 2y - z + 3s - 4t = 0 

x + 2y - z + 3s - 2t = 0 2a; + 4j/ - 2z - s + 5t = 0 

2x + 4y - 7z + s+ t = 0 2« + 4j/ - 2z + 4s - 2t = 0 

(i) (ii) 

5.71. Find a homogeneous system whose solution set W is generated by 

{(1, -2, 0, 3, -1), (2, -3, 2, 5, -3), (1, -2, 1, 2, -2)} 

5.72. Let V and PT be the following subspaces of R 4 : 

V = {(a.b.e.d): 6-2c + d = 0}, W = {(a.b.e.d): a = d, b = 2c} 
Find a basis and the dimension of (i) V, (ii) W, (iii) VnW. 

5.73. Let V be the vector space of polynomials in t of degree — n. Determine whether or not each of the 
following is a basis of V: 

(i) {1, 1 + t, 1 + t+t*. l + t + t 2 + £ 3 , l + t+t 2+ ... +t n-l + t n} 

(ii) {1 + t, t + P, «2 + (3, . . . , t»-2 + <n-l ( + f n}. 

SUMS AND INTERSECTIONS 

5.74. Suppose U and W are 2-dimensional subspaces of ft 3 . Show that V n W # {0>. 

5.75. Suppose C/ and PT are subspaces of V and that dim U = 4, dim T7 = 5 and dim V = 7. Find the 
possible dimensions of U nW. 

5.76. Let U and be subspaces of R 3 for which dim U - 1, dim W = 2 and U<tW. Show that 
R8 = [7 0 W. 

5.77. Let U be the subspace of R 5 generated by 

{(1, 3, -3, -1, -4), (1, 4, -1, -2, -2), (2, 9, 0, -5, -2)} 
and let W be the subspace generated by 

{(1, 6, 2, -2, 3), (2, 8, -1, -6, -5), (1, 3, -1, -5, -6)} 
Find (i) dim (U + W), (ii) dim (C7n W). 

5.78. Let V be the vector space of polynomials over R. Let U and W be the subspaces generated by 

{t 3 + U 2 -t + 3, t3 + 5t 2 + 5, 3t» + 10t z - 5t + 5} and {t 3 + 4t2 + 6, ts + 2t 2 -t + 5, 2t 3 + 2t 2 - 3t + 9} 
respectively. Find (i) dim (17 + W), (ii) dim (I7n W). 

5.79. Let U be the subspace of R 5 generated by 

{(1, -1, -1, -2, 0), (1, -2, -2, 0, -3), (1, -1, -2, -2, 1)} 
and let W be the subspace generated by 

{(1, -2, -3, 0, -2), (1, -1, -3, 2, -4), (1, -1, -2, 2, -5)} 

(i) Find two homogeneous systems whose solution spaces are U and W, respectively. 

(ii) Find a basis and the dimension of U n W. 
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COORDINATE VECTORS 

5.80. Consider the following basis of R 2 : {(2,1), (1,-1)}. Find the coordinate vector of "£8* relative 
to the above basis where: (i) v = (2,3); (ii) v = (4, -1), (iii) (3, -3); (iv) v = (a, b). 

5.81. In the vector space V of polynomials in t of degree ^ 3, consider the following basis: {1, 1 - t, (1 - t) 2 , 
(1 - t) 3 }. Find the coordinate vector of vBV relative to the above basis if: (i) v = 2 - 3t + t 2 + 2t 3 ; 
(ii) v - 3 - 2t - £ 2 ; (iii) v = o + ot + ct 2 + dt 3 . 

5 82 In the vector space W of 2 X 2 symmetric matrices over R, consider the following basis: 

iii). a :)■(--)} 

Find the coordinate vector of the matrix A&W relative to the above basis if: 

... * = (4 1) « '-(;:) 

5.83. Consider the following two bases of R 3 : 

{e, = (1, 1, 1), e 2 = (0, 2, 3), e 3 = (0, 2, -1)} and {/i = (1, 1, 0), / 2 = (1, -1, 0), f 3 = (0, 0, 1)} 

(i) Find the coordinate vector of v - (3,5,-2) relative to each basis: [v] e and [v] f . 

(ii) Find the matrix P whose rows are respectively the coordinate vectors of the e { relative to the 
basis {/i,/ 2 ,/s}. 

(iii) Verify that [«],P= [v] f . 

5.84. Suppose {e v ...,«„} and ...,/„} are bases of a vector space V (of dimension n). Let P be the 
matrix whose rows are respectively the coordinate vectors of the e's relative to the basis {/ s }. Prove 
that for any vector v&V, [v] e P = [v] f . (This result is proved in Problem 5.39 in the case n - 3.) 

5.85. Show that the coordinate vector of 0£V relative to any basis of V is always the zero ti-tuple 

(0,0, ...,0). 

RANK OF A MATRIX 

5.86. Find the rank of each matrix: 




5.87. 



Let A and B be arbitrary m X n matrices. Show that rank {A + B) — rank (A) + rank (B). 



5.88. Give examples of 2 X 2 matrices A and B such that: 

(i) rank (A + B) < rank (A), rank (B) (ii) rank (A + B) = rank (A) = rank (B) 

(iii) rank (A + B) > rank (A), rank (B) 

MISCELLANEOUS PROBLEMS 

5.89. Let W be the vector space of 3 X 3 symmetric matrices over K. Show that dim W = 6 by ex- 
hibiting a basis of W. (Recall that A = (a sj ) is symmetric iff a M = a }i .) 

5.90. Let W be the vector space of 3 X 3 antisymmetric matrices over K. Show that dim W = 3 by 
exhibiting a basis of W. (Recall that A = (a y ) is antisymmetric iff a« - -%.) 

5.91. Suppose dim V = n. Show that a generating set with n elements is a basis. (Compare with Theorem 
5.6(iii), page 89). 
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5.92. Let t lr t 2 , . . ., t n be symbols, and let K be any field. Let V be the set of expressions + a 2 t 2 + 
••• + a n t n where a t G K. Define addition in V by 

(ojt, + a 2 t 2 + • • • + a n tj + (fcj*! + b 2 t 2 + ■ ■ ■ + b n t n ) 

= K + &,)*„ + (a 2 + 6 2 )t 2 + ••• + (a„+6 n )f„ 

Define scalar multiplication on V by 

k(a 1 t 1 + a 2 t 2 + • ■ • + o B t„) = feo^j + fca 2 t 2 + • • ■ + ka n t n 

Show that y is a vector space over K with the above operations. Also show that {t u ...,t n } is a 
basis of V where, for i = 1, . . . , », 

tj = 0«l + • • • + Ot { _i + It, + 0< i + 1 + • • • + 0t„ 

5.93. Let V be a vector space of dimension n over a field X, and let K be a vector space of dimension m 
over a subfield F. (Hence V may also be viewed as a vector space over the subfield F.) Prove that 
the dimension of V over F is mn. 

5.94. Let £7 and W be vector spaces over the same field K , and let V be the external direct sum of U and 
W (see Problem 4.45). Let U and # be the subspaces of V defined by U - {(it, 0) : u G 17} and 
j£ = {(0, w) : w G W). 

(i) Show that 17 is isomorphic to U under the correspondence it <-> (m, 0), and that W is iso- 
morphic to W under the correspondence w <-» (0, w). 

(ii) Show that dim V = dim U + dim W. 

5.95. Suppose V = t7 ® IP. Let F be the external direct product of U and W. Show that V is isomorphic 
to V under the correspondence v = u + w <r± (u, w). 



Answers to Supplementary Problems 

5.52. (i) no, (ii) yes, (iii) yes, (iv) no, (v) yes, (vi) no, (vii) yes, (viii) no. 

5.53. (i) dependent, (ii) independent. 

5.54. (i) dependent, (ii) independent. 

5.55. (i) independent, (ii) dependent. 

5.57. (i) (2,-1 + 0 = (1 + 0(1 -t,<); (ii) (7, 1 + 2A/2) = (3- V / 2)(3 + -^2. 1 + V 7 ^). 

5.59. The statement is false. Counterexample: w = (1, 0), v — (2, 0) and w = (1, 1) in R 2 . Lemma 5.2 
requires that one of the nonzero vectors u,v,w is a linear combination of the preceding ones. In 
this case, v = 2u. 

5.64. (i) yes, (ii) no, (iii) no, (iv) yes. 

5.65. (i) no, (ii) yes, (iii) no, (iv) no. 

5.66. (i) dim W = 3, (ii) dim W = 2. 

5.67. dim W = 2. 

5.68. dim W = 2. 

5.69. (i) basis, {(7, -1, -2)}; dim W = 1. (ii) dim W = 0. (iii) basis, {(18, -1, -7)}; dim W = 1. 

5.70. (i) basis, {(2, -1, 0, 0, 0), (4, 0, 1, -1, 0), (3, 0, 1, 0, 1)}; dim W = 3. 
(ii) basis, {(2, -1, 0, 0, 0), (1, 0, 1, 0, 0)}; dim W = 2. 
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f5x + y — z — s = 0 
5.71. J 

[x+y-z-t = 0 



5.72. (i) basis, {(1,0,0,0), (0,2,1,0), (0,-1,0,1)}; dimV = 3. 

(ii) basis, {(1,0,0,1), (0,2,1,0)}; dim W = 2. 

(iii) basis, {(0,2,1,0)}; dim (VnW) = 1. Hint. VnW must satisfy all three conditions on a, b,c 
and d. 

5.73. (i) yes, (ii) no. For dim V = n + 1, but the set contains only n elements. 
5.75. dim(t/nW0 = 2, 3 or 4. 

5.77. dim (U+W) = 3, dim (UnW ) = 2. 

5.78. dim (U + W) = 3, dim (17 n W) = 1. 

5 79 (i) / to + 4 *~ a "* = ° {* X + 2 V ~ 8 = 0 
\4x + 2j/ + s = 0 ' [9a; + 2y + z + t = 0 

(ii) {(1, -2, -5, 0, 0), (0, 0, 1, 0, -1)}. dim (UnW) = 2. 

5.80. (i) [v] = (5/3, -4/3), (ii) [v] = (1, 2), (iii) [«] = (0, 3), (iv) [v] = ((a + 6)/3, (a - 2ft)/3). 

5.81. (i) [v] = (2, -5, 7, -2), (ii) [v] = (0, 4, -1, 0), (iii) [v] = (a + b + c + d, -b - 2c - 3d, e + 3d, -d). 

5.82. (i) [A] = (2, -1, 1), (ii) [A] = (3, 1, -2). 

11 0 1\ 

5.83. (i) [«], = (3, -1, 2), [v], = (4, -1, -2); (ii) P = I 1 -1 8l. 
5.86. (i) 3, (ii) 2, (iii) 3, (iv) 2. 
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5.93. Hint. The proof is identical to that given in Problem 5.48, page 113, for a special case (when V is 
an extension field of K). 
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MAPPINGS 

Let A and B be arbitrary sets. Suppose to each aGA there is assigned a unique ele- 
ment of B; the collection, /, of such assignments is called a function or mapping (or: map) 
from A into B, and is written 



f:A 



or 



B 



We write f(a), read "/ of a", for the element of B that / assigns to aGA; it is called the 
value of / at a or the imaflre of a under /. If A' is any subset of A, then f(A') denotes the set 
of images of elements of A'; and if B' is any subset of B, then /->(#') denotes the set of 
elements of A each of whose image lies in B': 

f(A>) = (f(a) : a G A'} and f~\B') = {aGA: f(a) G B'} 

We call f(A') the image of A' and f~ l (B') the inverse image or preimage of 2?'. In particular, 
the set of all images, i.e. f(A), is called the image (or: range) of /. Furthermore, A is called 
the domain of the mapping f:A-*B, and Z? is called its co-domain. 

To each mapping f -.A^B there corresponds the subset of A x B given by 
{(a, /(a)) : aGA}. We call this set the graph of /. Two mappings f:A-*B and g:A-*B 
are denned to be egwaZ, written f = g, if /(a) = g(a) for every o£A, that is, if they have 
the same graph. Thus we do not distinguish between a function and its graph. The nega- 
tion of / = g is written / ¥• g and is the statement: there exists an aGA for which 
f(a) * g(a). 



Example 6.1: 



Let A — {a,b,c,d} and B — {x,y,z,w}. The following diagram defines a mapping 
/ from A into B: 




Here f(a) = y, f(b) = x, /(c) = z, and f(d) = y. Also, 

f({a, b,d}) = U(a), f(b), f(d)} = {y,x,y} - {x,y} 
The image (or: range) of / is the set {x,y,z}: f(A) = {x,y,z}. 

Example 6.2: Let /:R -» R be the mapping which assigns to each real number x its square a; 2 : 

x H> x 2 or f(x) = x 2 
Here the image of —3 is 9 so we may write /(— 3) = 9. 
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We use the arrow h» to denote the image of an arbitrary element x G A under a mapping 
f:A-*B by writing x**f{x) 
as illustrated in the preceding example. 



If we write the vectors in R 3 and 



(\ -3 5' 

Example 6.3: Consider the 2 X 3 matrix A = f g ^ _ 1 

R 2 as column vectors, then A determines the mapping T : R 3 -» R 2 denned by 
t> h> Ai>, that is, T(v) - Av, v G R 3 



Thus if « = 



then T(v) = Av 



Remark: 



Every mxn matrix A over a field K determines the mapping T:K n -*K m 
denned by v h> A*> 

where the vectors in K » and X" 1 are written as column vectors. For convenience 
we shall usually denote the above mapping by A, the same symbol used for the 
matrix. 



Example 6.4: 



Example 6.5: 



Example 6.6: 



Let V be the vector space of polynomials in the variable t over the real field R. 
Then the derivative defines a mapping D : V -» V where, for any polynomial / G V, 
we let D(f) = df/dt. For example, D(3t* - 5t + 2) = 6t - 5. 

Let V be the vector space of polynomials in t over R (as in the preceding example). 
Then the integral from, say, 0 to 1 defines a mapping 3 : V -* R where, for any 

polynomial / G V, we let $(f) = f /(t) <**• For example, 

■'o 

<7(3*2-5t + 2) = f (3t 2 -5t + 2)ctt = 4 

Note that this map is from the vector space V into the scalar field R, whereas the 

map in the preceding example is from V into itself. 

Consider two mappings f:A-B and g : B -* C illustrated below: 

f S \ B 



0. 



Let a G A; then /(a) G fJ, the domain of g. Hence we can obtain the image of /(a) 
under the mapping g, that is, g(f(a)). This map 

o H> flr(/(o)) 

from A into C is called the composition or product of / and g, and is denoted by 
g of. In other words, (g°f):A-> C is the mapping defined by 

W°f)(a) = 0 (/(«)) 

Our first theorem tells us that composition of mappings satisfies the associative law. 
Theorem 6.1: Let f.A^B, g-.B^C and h.C^D. Then fc°(<7°/) = (h°9)°f- 
We prove this theorem now. If a G A, then 

(h°{g°f)){a) = h{(gof)(a)) = h(g(f{a))) 

and /)(<*) = (hog)(f(a)) = h(g(f(a))) 

Thus (ho(gof))(a) = ((hog)of)(a) for every a G A, and so ho(gof) = (h»g)of. 

Remark: Let F:A-»B. Some texts write aF instead of F(a) for the image of a G A 
under F. With this notation, the composition of functions F:A->B and 
G:B^C is denoted by F o G and not by G o F as used in this text. 
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We next introduce some special types of mappings. 

Definition: A mapping f:A-*B is said to be one-to-one (or one-one or 1-1) or infective 
if different elements of A have distinct images; that is, 

if o ^ o' implies f(a) ¥° f(a') 

or, equivalently, if /(a) = /(a') implies a = a' 

Definition: A mapping f:A-*B is said to be onto {or: / maps A onto B) or surjective if 
every 6 GB is the image of at least one o G A. 

A mapping which is both one-one and onto is said to be bijective. 

Example 6.7: Let / : R -» R, g : R -» R and h : R -> R be denned by f(x) = 2*, g(x) - x 3 - x and 
h(x) = * 2 . The graphs of these mappings follow: 




fix) = 2* g(x) = x 3 - x h(x) = x 2 



The mapping / is one-one; geometrically, this means that each horizontal line does 
not contain more than one point of /. The mapping g is onto; geometrically, this 
means that each horizontal line contains at least one point of g. The mapping ft 
is neither one-one nor onto; for example, 2 and —2 have the same image 4, and —16 
is not the image of any element of R. 

Example 6.8: Let A be any set. The mapping / : A -* A denned by /(&) — ct, i.e. which assigns 
to each element in A itself, is called the identity mapping on A and is denoted by 
1 A or 1 or /. 

Example 6.9: Let f:A->B. We call g:B->A the inverse of /, written if 

fog = 1 B and g°f = 1 A 

We emphasize that / has an inverse if and only if / is both one-to-one and onto 
(Problem 6.9). Also, if 6 e B then /~ l (b) = a where a is the unique element of A 
for which f{a) — 6. 



LINEAR MAPPINGS 

Let V and U be vector spaces over the same field K. A mapping F : V -* U is called a 
linear mapping (or linear transformation or vector space homomorphism) if it satisfies the 
following two conditions: 

(1) For any v, w G V, F(v + w) = F(v) + F(w). 

(2) For any k G K and any v<EV, F(kv) = kF{v). 

In other words, F:V ^ U is linear if it "preserves" the two basic operations of a vector 
space, that of vector addition and that of scalar multiplication. 

Substituting k = 0 into (2) we obtain F(0) = 0. That is, every linear mapping takes 
the zero vector into the zero vector. 
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Now for any scalars a, b G K and any vectors v, w G V we obtain, by applying both 
conditions of linearity, 

F(av + bw) = F(av) + F(bw) = aF(v) + bF(w) 

More generally, for any scalars a { GK and any vectors vi&V we obtain the basic 
property of linear mappings: 

F(aiVi + CL2V2 +■•■+ a„v n ) = aiF(vi) + a 2 F(v 2 ) + ••■ + a n F(v n ) 
We remark that the condition F(av + bw) = aF(v) + bF{w) completely characterizes 
linear mappings and is sometimes used as its definition. 

Example 6.10: Let A be any mXn matrix over a field K. As noted previously, A determines a 
mapping T : K n -* K m by the assignment v i-* Av. (Here the vectors in K n and K m 
are written as columns.) We claim that T is linear. For, by properties of matrices, 

T( v + w) = A(v + w) = Av + Aw = 2» + T(w) 
and T(kv) = A(kv) = kAv = kT(v) 

where v,w € K n and k & K. 
We comment that the above type of linear mapping shall occur again and again. In 
fact, in the next chapter we show that every linear mapping from one finite-dimensional 
vector space into another can be represented as a linear mapping of the above type. 

Example 6 11 • Let F : R 3 -* R 3 be the "projection" mapping into the xy plane: F(x, y, 2) = (*, y, 0). 
We show that F is linear. Let v = (a, b, c) and w = (a', 6', e'). Then 
F(v + w) = F(a + a', b + V, c + c') = (a + a', b + b', 0) 
= (a, b, 0) + (a', 6', 0) = F(v) + F(w) 

and, for any k S R, 

F(kv) = F(ka, kb, kc) = (feo, kb, 0) = k(a, b, 0) = kF(v) 

That is, F is linear. 

Example 612- Let F : R 2 -> R 2 be the "translation" mapping defined by F(x,y) = (a + l.y + 2). 

Observe that F(0) = F(0,0) = (1,2) * 0. That is, the zero vector is not mapped 
onto the zero vector. Hence F is not linear. 

Example 6.13: Let F : V -» U be the mapping which assigns 0 S U to every v G V. Then, for 
any v,w<EV and any k&K, we have 

F ( V + W ) = 0 = 0 + 0 = F(u) + F(w) and F(fc«) = 0 = fcO = fcF(v) 
Thus F is linear. We call F the zero mapping and shall usually denote it by 0. 

Example 6.14: Consider the identity mapping I.V^V which maps each vBV into itself. Then, 
for any v, w e V and any o, b G K, we have 

I(av + bw) = av + bw = al(v) + bl(w) 

Thus / is linear. 

Example 6.15: Let V be the vector space of polynomials in the variable t over the real field IR 
Then the differential mapping D : V - V and the integral mapping 3- V-B 
defined in Examples 6.4 and 6.5 are linear. For it is proven in calculus that for any 
u, v G V and k G R, 

<*(" + «> = and « = fcff 

dt dt dt at at 

that is, D(u + v) = D(u) + D(v) and flW=^M; and also, 

j' 1 (u(t) + v(t)) dt = f w(t) dt + f r(t) dt 
0 ■'0 ^0 

and f fc«(t)dt = fe C M(t)dt 

that is, ^(« + v) = ,5(m) + and ^(fcw) = fe^(«). 
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Example 6.16: Let F:V -» U be a linear mapping which is both one-one and onto. Then an inverse 
mapping F~* : U -» V exists. We will show (Problem 6.17) that this inverse map- 
ping is also linear. 

When we investigated the coordinates of a vector relative to a basis, we also introduced 
the notion of two spaces being isomorphic. We now give a formal definition. 

Definition: A linear mapping F : V -» U is called an isomorphism if it is one-to-one. The 
vector spaces V, U are said to be isomorphic if there is an isomorphism of 
V onto U. 

Example 6.17: Let V be a vector space over K of dimension n and let {e u . . ., e n } be a basis of V. 

Then as noted previously the mapping v t-> [v] e , i.e. which maps each v G V into 
its coordinate vector relative to the basis {e}, is an isomorphism of V onto K n . 

Our next theorem gives us an abundance of examples of linear mappings; in particular, 
it tells us that a linear mapping is completely determined by its values on the elements 
of a basis. 

Theorem 6.2: Let V and U be vector spaces over a field K. Let {vi,v 2 , ...,««} be a basis 
of V and let Ui, u%, ... , u n be any vectors in U. Then there exists a unique 
linear mapping F-.V-+U such that F(vi) = u u F(v 2 ) = Ut, F(v n ) = 

We emphasize that the vectors u h . . .,Un in the preceding theorem are completely ar- 
bitrary; they may be linearly dependent or they may even be equal to each other. 



KERNEL AND IMAGE OF A LINEAR MAPPING 

We begin by defining two concepts. 

Definition: Let F:V-*U be a linear mapping. The image of F, written Im F, is the set 
of image points in U: 

Im F = {uGU: F(v) = u for some v G V) 

The kernel of F, written KerF, is the set of elements in V which map into 
OGU: 

Ker F = {v G V : F(v) = 0} 
The following theorem is easily proven (Problem 6.22). 

Theorem 6.3: Let F:V^>U be a linear mapping. Then the image of F is a subspace 
of U and the kernel of F is a subspace of V. 



Example 6.18: Let F : R 3 -» R3 be the projection map- 
ping into the xy plane: F(x, y, z) — 
(x, y, 0). Clearly the image of F is the 
entire xy plane: 

Im F = {(a, b, 0) : a, be. R} 

Note that the kernel of F is the z axis: 

Ker^ = {(0, 0, c): c G R} 

since these points and only these points 
map into the zero vector 0 = (0, 0, 0). 



z 




v = (a, b, c) 



Fiv)=(a,b,0) 
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Theorem 5.11: The dimension of the solution space W of the homogeneous system of linear 
equations AX = 0 is n-r where n is the number of unknowns and r is the 
rank of the coefficient matrix A. 



OPERATIONS WITH LINEAR MAPPINGS 

We are able to combine linear mappings in various ways to obtain new linear mappings. 
These operations are very important and shall be used throughout the text. 

Suppose F : V -> U and G:V-*U are linear mappings of vector spaces over a field K. 
We define the sum F + G to be the mapping from V into U which assigns F(v) + G{v) to 

vGV: (F + G)(v) = F(v) + G(v) 

Furthermore, for any scalar k e K, we define the product kF to be the mapping from V 
into U which assigns kF(v) to v € V: 

(kF)(v) = kF(v) 

We show that if F and G are linear, then F + G and kF are also linear. We have, for any 

vectors v,w &V and any scalars a, b G K, 

(F + G)(av + bw) = F(av + bw) + G{av + bw) 

= aF(v) + bF(w) + aG{v) + bG{w) 
= a(F(v) + G{v)) + b(F(w) + G(w)) 
= a(F + G)(v) + b(F + G)(w) 

and (kF)(av + bw) = kF(av + bw) = k(aF(v) + bF(w)) 

= akF(v) + bkF(w) = a(kF)(v) + b(kF)(w) 

Thus F + G and kF are linear. 

The following theorem applies. 

Theorem 6.6: Let V and U be vector spaces over a field K. Then the collection of all 
linear mappings from V into U with the above operations of addition and 
scalar multiplication form a vector space over K. 

The space in the above theorem is usually denoted by 

Horn (V, U) 

Here Horn comes from the word homomorphism. In the case that V and U are of finite 
dimension, we have the following theorem. 

Theorem 6.7: Suppose dimV = m and dimU = n. Then dim Hom(V, U) = mn. 

Now suppose that V, U and W are vector spaces over the same field K, and that F:V-*U 
and G : U ■* W are linear mappings: 

Recall that the composition function GoF is the mapping from V into W defined by 
(G°F)(v) = G(F(v)). We show that GoF is linear whenever F and G are linear. We have, 
for any vectors v, w G V and any scalars a,b GK, 

(GoF)(av + bw) = G(F(av + bw)) = G(aF(v) + bF(w)) 

= aG(F(v)) + bG(F(w)) = a{G°F){v) + b(GoF)(w) 

That is, G°F is linear. 
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The composition of linear mappings and that of addition and scalar multiplication are 
related as follows: 

Theorem 6.8: Let V, U and W be vector spaces over K. Let F, F' be linear mappings from 
V into U and G, G' linear mappings from U into W, and let k G K. Then: 

(i) G°(F + F') = G<>F + G°F' 

(ii) (G + G')oF = GoF + G'oF 

(iii) k(GoF) = (kG)oF = Go(kF). 



ALGEBRA OF LINEAR OPERATORS 

Let V be a vector space over a field K. We now consider the special case of linear map- 
pings T : V -» V, i.e. from V into itself. They are also called linear operators or linear 
transformations on V. We will write A(V), instead of Horn (V, V), for the space of all such 
mappings. 

By Theorem 6.6, A(V) is a vector space over K; it is of dimension n 2 if V is of dimension 
n. Now if T,S G A(V), then the composition SoT exists and is also a linear mapping 
from V into itself, i.e. SoT G A(V). Thus we have a "multiplication" denned in A(V). 
(We shall write ST for S°T in the space A(V).) 

We remark that an algebra A over a field X is a vector space over K in which an opera- 
tion of multiplication is defined satisfying, for every F,G,HGA and every k G K, 

(i) F(G + H) = FG + FH 

(ii) (G + H)F = GF + HF 

(iii) k(GF) = (kG)F = G(kF). 

If the associative law also holds for the multiplication, i.e. if for every F,G,H G A, 

(iv) (FG)H = F(GH) 

then the algebra A is said to be associative. Thus by Theorems 6.8 and 6.1, A(V) is an 
associative algebra over K with respect to composition of mappings; hence it is frequently 
called the algebra of linear operators on V. 

Observe that the identity mapping I:V-*V belongs to A(V). Also, for any T G A(V), 
we have TI = IT = T. We note that we can also form "powers" of T; we use the notation 
T 2 = T o T, T* = T o T o T, .... Furthermore, for any polynomial 

p{x) = do + aix + a 2 x 2 + • • • + a„x n , a ; G K 

we can form the operator p(T) denned by 

p(T) = aol + axT + a 2 T 2 + • • • + OnT n 

(For a scalar k G K, the operator kl is frequently denoted by simply k.) In particular, if 
p{T) = 0, the zero mapping, then T is said to be a zero of the polynomial p(x). 

Example 6.21: Let T : R3 -* R3 b e defined by T(x,y,z) = (0,x,y). Now if (a,b,c) is any element 
of R 3 , then: 

(T + I)(a, b, e) = (0, a, b) + (a, b, c) = (a,a+b,b + c) 

and T*(a, b, c) = T 2 ®, a, b) - T(0, 0, a) = (0, 0, 0) 

Thus we see that T 3 = 0, the zero mapping from V into itself. In other words, 
T is a zero of the polynomial p{x) = x 3 . 
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INVERTIBLE OPERATORS 

A linear operator T : V -> V is said to be invertible if it has an inverse, i.e. if there 
exists T-'GAiV) such that TT' 1 = T~ 1 T = 1. 

Now T is invertible if and only if it is one-one and onto. Thus in particular, if T is 
invertible then only 0 G V can map into itself, i.e. T is nonsingular. On the other hand, 
suppose T is nonsingular, i.e. Ker T = {0}. Recall (page 127) that T is also one-one. More- 
over, assuming V has finite dimension, we have, by Theorem 6.4, 

dimV = dim(ImT) + dim (Ker T) = dim(ImT) + dim({0}) 
= dim (Im T) + 0 = dim (Im T) 

Then Im T = V, i.e. the image of T is V; thus T is onto. Hence T is both one-one and onto 
and so is invertible. We have just proven 

Theorem 6.9: A linear operator T:V-*V on a vector space of finite dimension is in- 
vertible if and only if it is nonsingular. 

Example 6.22: Let T be the operator on R 2 defined by T(x, y) = (y, 2x-y). The kernel of T is 
{(0, 0)}; hence T is nonsingular and, by the preceding theorem, invertible. We now 
find a formula for T~ l . Suppose (s, t) is the image of (x, y) under T; hence (x, y) 
is the image of (s, t) under T~h T(x, y) = (s, t) and T~Hs, t) = (x, y). We have 

T(x, y) - (y, 2x — y) = (s, t) and so V = s, 2* - y = t 

Solving for x and y in terms of s and t, we obtain x = |s + ^t, y = s. Thus T~ l 
is given by the formula T~ l {s, t) = + 8). 

The finiteness of the dimensionality of V in the preceding theorem is necessary as seen 
in the next example. 

Example 6.23: Let V be the vector space of polynomials over K, and let T be the operator on V 
defined by 

T(a 0 + a t t + ■ • • + a n t") = a 0 t + ai t z + ■ ■ ■ + a n t n + 1 

i.e. T increases the exponent of t in each term by 1. Now T is a linear mapping 
and is nonsingular. However, T is not onto and so is not invertible. 

We now give an important application of the above theorem to systems of linear 
equations over K. Consider a system with the same number of equations as unknowns, 
say n. We can represent this system by the matrix equation 

Ax = b (*) 
where A is an n-square matrix over K which we view as a linear operator on K n . Suppose 
the matrix A is nonsingular, i.e. the matrix equation Ax = 0 has only the zero solution. 
Then, by Theorem 6.9, the linear mapping A is one-to-one and onto. This means that the 
system (*) has a unique solution for any b e K n . On the other hand, suppose the matrix 
A is singular, i.e. the matrix equation Ax = 0 has a nonzero solution. Then the linear 
mapping A is not onto. This means that there exist b G K n for which (*) does not have a 
solution. Furthermore, if a solution exists it is not unique. Thus we have proven the 
following fundamental result: 

Theorem 6.10: Consider the following system of linear equations: 

auXi + ai 2 x 2 + • • • + amXn — bi 
a 2 iXi + az2X 2 + ■ ■ ■ + a 2n x n = b 2 



a„lXl + aniXi + ■ ■ ■ + annXn = bn 
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(i) If the corresponding homogeneous system has only the zero solution, 
then the above system has a unique solution for any values of the b» 

(ii) If the corresponding homogeneous system has a nonzero solution, then: 
(i) there are values for the 6 t for which the above system does not have 
a solution; (ii) whenever a solution of the above system exists, it is 
not unique. 



Solved Problems 



MAPPINGS 

6.1. State whether or not each diagram defines a mapping from A = {a, b, c) into 

B = {x,y,z}. 





(i) (ii) 

(i) No. There is nothing assigned to the element be A. 

(ii) No. Two elements, x and z, are assigned to c G A. 

(iii) Yes. 




6.2. Use a formula to define each of the following functions from R into R. 

(i) To each number let / assign its cube. 

(ii) To each number let g assign the number 5. 

(iii) To each positive number let h assign its square, and to each nonpositive number 
let h assign the number 6. 

Also, find the value of each function at 4, -2 and 0. 

(i) Since / assigns to any number x its cube x 3 , we can define / by f(x) = x 3 . Also: 

/(4) = 43 = 64, /(-2) = (-2)3 = -8, /(0) = 03 = 0 

(ii) Since g assigns 5 to any number x, we can define g by g(x) = 5. Thus the value of g at each 
number 4, —2 and 0 is 5: 

g(4) = 5, j/(-2) = 5, g(0) = 5 

(iii) Two different rules are used to define h as follows: 

fx 2 if x > 0 
h(x) = 1 

[6 if x =2 0 

Since 4 > 0, h(4) = 4 2 = 16. On the other hand, -2,0 ^ 0 and so h{-2) = 6, h(0) = 6. 
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6.3. Let A = {1,2,3,4,5} and let f :A-*A be the map- 
ping defined by the diagram on the right, (i) Find 
the image of /. (ii) Find the graph of /. 

(i) The image f(A) of the mapping / consists of all the points 
assigned to elements of A. Now only 2, 3 and 5 appear as 
the image of any elements of A; hence f(A) = {2,3, 5}. 

(ii) The graph of / consists of the ordered pairs (a,f(a)), 
where a e A. Now /(l) = 3, /(2) = 5, /(3) = 5, /(4) = 2, 
/(5) = 3; hence the graph of 

/ = {(1,3), (2, 5), (3, 5), (4, 2), (5, 3)} 




6.4. Sketch the graph of (i) f(x) = x 2 + x - 6, (ii) g(x) = x 3 - 3a; 2 - x + 3. 

Note that these are "polynomial functions". In each case set up a table of values for x and 
then find the corresponding values of f(x). Plot the points in a coordinate diagram and then draw 
a smooth continuous curve through the points. 




6.5. Let the mappings / : A -» 2? and g-.B^C be defined by the diagram 
A f B g C 




(i) Find the composition mapping (gof);A-*C. (ii) Find the image of each map- 
ping: /, g and gof. 

(i) We use the definition of the composition mapping to compute: 



to «/)(•) 


= ff(f(a)) 


= g(y) 


to°/)to) 


= g(f{b)) 


= a(x) 


to °/)(«) 


= e(f(<>)) 


= g{y) 



Observe that we arrive at the same answer if we "follow the arrows" in the diagram: 

a -» y -* t, b -» x -> s, e-* y -» t 



CHAP. 6] 



LINEAR MAPPINGS 



133 



(ii) By the diagram, the image values under the mapping / are x and y, and the image values under 
g are r, s and t; hence 

image of / = {x, y} and image of g = {r, s, t} 
By (i), the image values under the composition mapping go f ar e t and s; hence image of 
9 ° f = {», <}• Note that the images of g and g°f are different. 

6.6. Let the mappings / and g be defined by f(x) = 2x + 1 and g{x) = x 2 - 2. (i) Find 
(fl ro /)(4) and (/°fl0(4). (ii) Find formulas defining the composition mappings 
and /ogr. 

(i) /(4) = 2-4 + 1 = 9. Hence (flr°/)(4) = p(/(4)) - p-(9) = 92 - 2 = 79. 

0(4) = 42 - 2 = 14. Hence (/ o g)(4) = f(g(4)) = /{14) = 2 • 14 + 1 = 29. 

(ii) Compute the formula for g ° / as follows: 

(0 "/)(*) = flr(/(x)) = flf(2x + l) = (2a; + l) 2 -2 = 4s 2 + 4* - 1 

Observe that the same answer can be found by writing y = f(x) = 2% + 1 and z — g(y) = 
y 2 - 2, and then eliminating y: z = j/ 2 - 2 = (2x + l) 2 — 2 = 4* 2 + 4» — 1. 

(/ 0 ff)(*) = f(g(x)) = f(x 2 -2) = 2(a; 2 -2) + 1 = 2z 2 - 3. Observe that f°g*g°f. 

6.7. Let the mappings f : A -* B, g : B -» C and h.C-*D be defined by the diagram 



A f B g c h D 




Determine if each mapping (i) is one-one, (ii) is onto, (iii) has an inverse. 

(i) The mapping / : A -* B is one-one since each element of A has a different image. The mapping 
g:B->C is not one-one since * and z both map into the same element 4. The mapping h:C -* D 
is one-one. 

(ii) The mapping f :A^B is not onto since z e B is not the image of any element of A. The 
mapping g : B -» C is onto since each element of C is the image of some element of B. The 
mapping h:C^*D is also onto. 

(iii) A mapping has an inverse if and only if it is both one-one and onto. Hence only h has an 
inverse. 

6.8. Suppose f:A-*B and g:B->C; hence the composition mapping (gof):A^C exists. 
Prove the following, (i) If / and g are one-one, then gof is one-one. (ii) If / and g 
are onto, then gof is onto, (iii) If gof is one-one, then / is one-one. (iv) If gof is 
onto, then g is onto. 

(i) Suppose (g°f)(x) = (g°f)(y). Then g(f(x)) = g(J(y)). Since g is one-one, f(x) = f(y). Since/ 
is one-one, x = y. We have proven that (fir ° /)(«) = (g°f)(y) implies x = y; hence flr°/ is 
one-one. 

(ii) Suppose c G C. Since p- is onto, there exists 6GB for which jr(6) = c. Since / is onto, there 
exists a G A for which f(a) = 6. Thus (fir °/)(o) = fir(/(a)) = jf(6) = c; hence 0 ° / is onto. 

(iii) Suppose / is not one-one. Then there exists distinct elements x,yGA for which f(x) = f(y). 
Thus {g°f)(x) = g{f(x)) = g(f(y)) = (g°f)(y); hence g°f is not one-one. Accordingly it g°f 
is one-one, then / must be one-one. 

(iv) If aG A, then (g°f)(a) = fl r (/(a)) G ff(B); hence (fir °/)(A) C g(B). Suppose fir is not onto. 
Then g(B) is properly contained in C and so (g°f){A) is properly contained in C; thus g°f is 
not onto. Accordingly if fir ° / is onto, then g must be onto. 
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6.9. Prove that a mapping f:A-*B has an inverse if and only if it is one-to-one and onto. 

Suppose / has an inverse, i.e. there exists a function / -1 : £ -> A for which f~ lo f = t A a« d 
/o/-i = l B . Since 1 A is one-to-one, / is one-to-one by Problem 6.8(iii); and since 1 B is onto, / is 
onto by Problem 6.8(iv). That is, / is both one-to-one and onto. 

Now suppose / is both one-to-one and onto. Then each i)£B is the image of a unique element 

in A, say 6. Thus if f(a) = b, then a = 6; hence /(6) = 6. Now let g denote the mapping from 
a. 

B to A denned by 6 h» b. We have: 

(i) (ff ° f)(fl) - g(f(a)) = gib) - b = a, for every a e A; hence g ° / = 1 A . 

(ii) (f°g)(b) = f(g(b)) = f(b) = 6, for every b & B; hence f°g = l B . 
Accordingly, / has an inverse. Its inverse is the mapping g. 

6.10. Let / : R -» R be denned by f(x) = 2x - 3. Now / is one-to-one and onto; hence / 
has an inverse mapping Find a formula for f' 1 . 

Let y be the image of x under the mapping /: y = /(») = 2x - 3. Consequently x will be the 
image of y under the inverse mapping f~K Thus solve for x in terms of y in the above equation: 
x = (j/ + 3)/2. Then the formula denning the inverse function is /-%) = (y + 3)/2. 

LINEAR MAPPINGS 

6.11. Show that the following mappings F are linear: 

(i) F:R 2 ^R 2 denned by F(x,y) = (x + y,x). 

(ii) F:R 3 -»R denned by F(x, y, z) = 2x - By + 4z. 

(i) Let v = (a, 6) and w = (a', b'); hence 

v + w = (a + a',b + b') and kv = (ha, kb), kSR 

We have F(v) = (a + 6, a) and F(w) = (a' + b', a'). Thus 

F(y + w) = F(a + a',b+b') - (a+ a' + b + b', a+ a') 
= (a + 6, a) + (a' + a') = F(v) + F(w) 
and F(fev) = F(ka, kb) - (ka+kb,ka) = k(a+b,a) = kF{v) 

Since v, w and A; were arbitrary, F is linear. 

(ii) Let v - (a, b,c) and w = (a', b', e')\ hence 

v + w = (a + a', 6 + 6', c + c') and fo> = (ka, kb, kc), fcSE 

We have = 2a - 36 + 4c and F(w) = 2a' - 3b' + 4c'. Thus 

F(v + w) = F(o + o', 6 + 6', c + c') = 2(o + a') - 3(6 + 6') + 4(c + c') 
= (2a - 36 + 4c) + (2a' - 36' + 4c') = F(v) + F(w) 
and F(kv) - F(ka,kb,ke) = 2ka - Zkb + 4kc = /c(2a-36 + 4c) = kF(v) 

Accordingly, F is linear. 

6.12. Show that the following mappings F are not linear: 

(i) F : R 2 -> R defined by F(x, y) = xy. 

(ii) F :B?^R 3 denned by F(x, y) = (x + 1, 2y, x + y). 
(ill) F : R 3 -» R 2 defined by F(a?, 3/, z) = (|as|, 0). 

(i) Let i> = (l,2) and w = (3,4); then v + w=(4,6). 

We have F(v) = 1*2 = 2 and F(w) = 3 • 4 = 12. Hence 
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F(v + w) = F(4, 6) = 4-6 =24 # F(v) + F(w) 
Accordingly, F is not linear. 

(ii) Since F(0, 0) = (1, 0, 0) # (0, 0, 0), F cannot be linear. 

(iii) Let v = (1, 2, 3) and k = -3; hence kv = (-3, -6, -9). 

We have F(v) = (1,0) and so kF(v) = -3(1,0) = (-3, 0). Then 
F(fcv) = F(-3, -6, -9) = (3, 0) # feF(v) 

and hence F is not linear. 



6.13. Let V be the vector space of ra-square matrices over K. Let M be an arbitrary matrix 
in V. Let T : V -» V be denned by T(A) = AM + MA, where A G 7. Show that 
T is linear. 

For any A,B G V and any k G K, we have 

T(A + B) = (A + B)M + M(A + B) = AM + BM + MA + MB 
= (AM + MA) + (BM + MB) = T(A) + T(B) 
and T(kA) = (kA)M + M(kA) = lc(AM) + k(MA) = k(AM + MA) - kT(A) 

Accordingly, T is linear. 

6.14. Prove Theorem 6.2: Let V and U be vector spaces over a field K. Let {Vi, . . . , v n } be 
a basis of V and let u u . . . , u n be any arbitrary vectors in U. Then there exists a 
unique linear mapping F : V -» C7 such that = in, F(v 2 ) = Ma, . . . , F^„) = «„. 

There are three steps to the proof of the theorem: (1) Define a mapping F : V -» U such that 
F(v ( ) = w i( i = 1, . . . , n. (2) Show that F is linear. (3) Show that F is unique. 

Step (1). Let «£V. Since {> t , . . .,v n } is a basis of V, there exist unique scalars Oj, . . . ,a n G K 

for which « = + a 2 v 2 H + a„v n . We define F:V^U by F(v) = + + • • • + a„M„. 

(Since the a t are unique, the mapping F is well-defined.) Now, for i = 1, . . . , n, 

Vi = Ovj + • • • + liJj + • • • + 0v n 

Hence F(v 4 ) - 0% + • • • + lu, + • • • + 0u n = u t 

Thus the first step of the proof is complete. 

Step (2). Suppose v = a t v t + a 2 v 2 + • • • + a n v n and w = b x v x + b 2 v 2 + • • • + b n v n . Then 

v + w = (o, + b 1 )v l + (a 2 + b 2 )v 2 + ■■■ + (a n + b n )v n 

and, for any k € K, kv = ka x v t + ka 2 v 2 + • • ■ + ka n v n . By definition of the mapping F, 

F(v) = + a 2 u 2 + ■■■ + a n u n and F(w) = 6 lMl + b 2 v 2 + ■ ■ ■ + b n u n 

Hence F(v + w) = (a t + b^ + (a 2 + 6 2 )w 2 + •••+(«„ + b n )u n 

= (a lUl + a 2 u 2 + • • ■ + a n u n ) + (b lUl + b 2 u 2 +■■■ + b n u n ) 
= F(v) + F(w) 

and F(kv) = ft^ttj + a 2 v^ H + a n u n ) = kF(v) 

Thus F is linear. 

Step (S). Now suppose G : V -» U is linear and G(v t ) = u { , i = 1, . . If v = + a 2 ^ 2 + 
• • • + a n v n , then 

G(t>) = ^(a^! + a 2 v 2 + • • • + a„v„) = a^vO + a 2 G(i; 2 ) + • • • + a n G(v n ) 
= a x u-L + a 2 u 2 + • • • + a n u n = F(v) 
Since G(v) = F(v) for every v G V, G = F. Thus F is unique and the theorem is proved. 
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6.15. Let T : R 2 -» R be the linear mapping for which 

T(l,l) = 3 and T(0, 1) = -2 (1) 

(Since {(1,1), (0,1)} is a basis of R 2 , such a linear mapping exists and is unique by 
Theorem 6.2.) Find T(a, b). 

First we write (a, 6) as a linear combination of (1, 1) and (0, 1) using unknown scalars x and y: 

(a, b) = «(1, 1) + i/(0, 1) (2) 
Then (a, 6) = (x, x) + (0, y) = (x, x + y) and so x-a, x + y- b 

Solving for x and y in terms of a and 6, we obtain 

a; = a and j/ = 6 — a (3) 

Now using (1) and (2) we have 

T(a, b) = T(x(l, 1) + y(0, 1)) = *r(l, 1) + yT(0, 1) = 3x - 2y 

Finally, using (3) we have T(o, 6) = 3x - 2y = 3(a) - 2(6 - a) = 5o - 2b. 

6.16. Let T : V-» U be linear, and suppose v u . . G V have the property that their 
images T(vi), . . ., T(v n ) are linearly independent. Show that the vectors vi, . . . , v n 
are also linearly independent. 

Suppose that, for scalars a lt a n , a t v t + a 2 v 2 H h a n v n = 0. Then 

0 = T(0) = T(a x vy + a 2 v 2 +■■■+ a n v n ) = aj^j) + a 2 T(v 2 ) + ■■■ + a n T(v n ) 

Since the T(v { ) are linearly independent, all the 0^ = 0. Thus the vectors v v ...,v n are linearly 
independent. 

6.17. Suppose the linear mapping F-.V^V is one-to-one and onto. Show that the inverse 
mapping F~ l : U -* V is also linear. 

Suppose u,u' S U. Since F is one-to-one and onto, there exist unique vectors v,v' GV for 
which F(v) = u and F(v') = u'. Since F is linear, we also have 

F(v + v') = F(v) + F(v') = u + u' and F{kv) = kF(v) = few 

By definition of the inverse mapping, F~Hu) = v, F'Hu') = V, F~Hu + u') = v + v' and 
F'Hku) = kv. Then 

F-»(m + m') = v + v' = F-Hv) + F-Hu') and F-^fcw) = kv = feF _1 (w) 
and thus F _1 is linear. 

IMAGE AND KERNEL OF LINEAR MAPPINGS 

6.18. Let F : R 4 -» R 3 be the linear mapping defined by 

F(x,y,8,t) = (x-y + s + t,x + 2s-t,x + y + Bs-Zt) 

Find a basis and the dimension of the (i) image U of F, (ii) kernel W of F . 
(i) The images of the following generators of R 4 generate the image U of F: 
F(l, 0, 0, 0) = (1,1,1) F(0, 0,1,0) = (1,2,3) 

F(0, 1,0,0) = (-1,0,1) F(0, 0,0,1) = (1,-1,-3) 
Form the matrix whose rows are the generators of U and row reduce to echelon form: 

to 




1 








1 




1 






0 


1 








M 




0 




1 




0 


2 


-ll 




\o 


0 


0/ 



Thus {(1, 1, 1), (0, 1, 2)} is a basis of U; hence dim U = 2. 
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(ii) We seek the set of (x, y, s, t) such that F(x, y, s, t) = (0, 0, 0), i.e., 

F(x, y,s,t) = (x-y + s + t,x + 2s-t,x + y + 3s-3t) = (0, 0, 0) 

Set corresponding components equal to each other to form the following homogeneous system 
whose solution space is the kernel W of F: 

x — y + s + t = 0 x — y + s + t = 0 

x + 2s - t = 0 or y + s — 2t = 0 or 

y + s - 2t = 0 

x + y + 3s - 3t = 0 2y + 2s - At = 0 

The free variables are s and t; hence dim W = 2. Set 

(a) s = —1, t = 0 to obtain the solution (2, 1, —1, 0), 

(6) s = 0, t = 1 to obtain the solution (1, 2, 0, 1). 

Thus {(2, 1, -1, 0), (1, 2, 0, 1)} is a basis of W. (Observe that dim U + dim W = 2 + 2 = 4, 
which is the dimension of the domain R 4 of F.) 



6.19. Let T : R 3 -» R 3 be the linear mapping denned by 

T(x, y,z) = (x + 2y - z, y + z, x + y - 2z) 

Find a basis and the dimension of the (i) image U of T, (ii) kernel W of T. 

(i) The images of generators of R 3 generate the image U of T: 

T(l, 0, 0) = (1, 0, 1), T(0, 1, 0) = (2, 1, 1), T(0, 0, 1) = (-1, 1, -2) 
Form the matrix whose rows are the generators of U and row reduce to echelon form: 



to 0 1-1 to 






Thus {(1, 0, 1), (0, 1, -1)} is a basis of U, and so dim V = 2. 

(ii) We seek the set of (x, y, z) such that T(x,y,z) = (0,0,0), i.e., 

T(x, y,z) = (x + 2y - z, y + z, x + y - 2z) - (0, 0, 0) 

Set corresponding components equal to each other to form the homogeneous system whose 
solution space is the kernel W of T: 

x + 2y - z = 0 x + 2y - z = 0 

x + 2y - z = 0 

y + z — 0 or y + z — 0 or 

y + z = 0 

x + y — 2z = 0 —y — z — 0 

The only free variable is z; hence dim W = 1. Let 8 = 1; then y = — 1 and x = 3. Thus 
{(3, —1, 1)} is a basis of W. (Observe that dim U + dim W = 2 + 1 = 3, which is the dimen- 
sion of the domain R 3 of T.) 



6.20. Find a linear map F : W -* R 4 whose image is generated by (1, 2, 0, —4) and (2, 0, -1, -3). 
Method 1. 

Consider the usual basis of R 3 : = (1, 0, 0), e 2 = (0, 1, 0), e 3 = (0, 0, 1). Set F(e t ) = (1, 2, 0, -4), 
F(e 2 ) = (2, 0, —1, —3) and ^(63) = (0, 0, 0, 0). By Theorem 6.2, such a linear map F exists and is 
unique. Furthermore, the image of F is generated by the F(e t ); hence F has the required property. 
We find a general formula for F(x, y, z): 

F(x, y, z) = F{xe l + ye 2 + ze 3 ) = xF(e t ) + yF(e 2 ) + zF(e 3 ) 

= x(t, 2, 0, -4) + y(2, 0, -1, -3) + z(0, 0, 0, 0) 

= (x + 2y, 2x, -y, -4a; - 3j/) 
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Method 2. 

Form a 4 X 3 matrix A whose columns consist only of the given vectors; say, 

1 2 2\ 

A = 




Recall that A determines a linear map A : R s -» B 4 whose image is generated by the columns of A. 
Thus A satisfies the required condition. 

6.21. Let V be the vector space of 2 by 2 matrices over R and let M = ( * \ ) . Let 



0 3, 

F:V-*V be the linear map defined by F(A) = AM — MA. Find a basis and the 
dimension of the kernel W of F. 



We seek the set of ^* *Q such that F^* ^ = ^ ^ 

F (s t) = (s t)(o 3) ~~ (0 s)(« 



a; y 
t 



/x 2x + 3j/\ _ /» + 2s y + 2t\ 

\a 2s + 3t ) \ 3s St ) 

/-2s 2x + 2y- 2t\ _ /0 0\ 

V-2s 2s J ~ \0 0/ 



Thus 2x + 2y-2t = 0 x + y - t = 0 

or 

2s = 0 s = 0 

The free variables are y and t; hence dim W = 2. To obtain a basis of W set 
(a) j/ = —1, t = 0 to obtain the solution x — 1, y = —1, 8 = 0, t = 0; 
(6) j/ = 0, t = 1 to obtain the solution x = 1, y = 0, » = 0, t = 1. 



a basis of W. 



6.22. Prove Theorem 6.3: Let F : V -*■ U be a linear mapping. Then (i) the image of F 
is a subspace of U and (ii) the kernel of F is a subspace of V. 

(i) Since F(0) =0, 0 e Im F. Now suppose w, u' G Im F and a,b& K. Since m and w' belong to 
the image of F, there exist vectors v, v' G V such that F(v) = tt and F(d') = u'. Then 

F(av + 617') = aF(v) + bF{v') = au + bu' G Im F 

Thus the image of F is a subspace of 17. 

(ii) Since F(0) = 0, 0 G Ker F. Now suppose v, w G Ker F and «, 6 G if. Since v and w belong 
to the kernel of F, F(v) = 0 and F(w) = 0. Thus 

F(av + bw) = aF(v) + bF(w) - oO + 60 = 0 and so av + bw G KerF 
Thus the kernel of F is a subspace of V. 



6.23. Prove Theorem 6.4: Let V be of finite dimension, and let F : V -* U be a linear map- 
ping with image V and kernel W. Then dim V + dim W = dim V. 

Suppose dim V = n. Since W is a subspace of V, its dimension is finite; say, dim W = r - n. 
Thus we need prove that dim V = n — r. 
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Let {w lt w r } be a basis of W . We extend {w s > to a basis of V: 

{m'i w r ,v u ...,i; n _ r } 

Let B = {FW.FW, ...,F(v n _ r )} 

The theorem is proved if we show that B is a basis of the image U' of F. 

Proof that B generates V. Let u e 17'. Then there exists v &V such that F(v) = w. Since 
{w j( v { } generates V and since v GV, 

v — ajWx + • • • + o r io r + b 1 v 1 + • • • + & n _ r i> n _ r 
where the a„ b t are scalars. Note that F(Wj) — 0 since the w { belong to the kernel of F. Thus 

u = F(v) = FfajWj + h o^+b!^! + ■ • • + b„_ r i; n _ r ) 

= a t F(w^ + • • • + a/W + o^K) + • • • + b B _ r F(v„_ r ) 
= OjO + • • • + a r 0 + b t F(vJ + • • • + o n _ r F(i; n _ r ) 
= + • • • + & n _ r FK_ r ) 

Accordingly, the F(i; s ) generate the image of F. 

Proof that B is linearly independent. Suppose 

a^vj) + a 2 F(v 2 ) + ■■• + »,./(«„.,) = 0 

Then F(a 1 v 1 + a 2 v 2 + ■ • • + a„_ r v n _ r ) = 0 and so + • • • + a„_ r v n _ r belongs to the kernel W 
of F. Since {wj generates W, there exist scalars b lt ...,b T such that 

a^! + a 2 v 2 + ■ ■ • + a n _ r u n _ r = biW! + b 2 w 2 + • • • + b r w r 

or + • • • + a n _ r i>„_ r — i>!Wi - • ■ • - b r w r = 0 (*) 

Since {w^ v t } is a basis of V, it is linearly independent; hence the coefficients of the w { and v { in (*) 
are all 0. In particular, a y = 0, . . ., a n _ r = 0. Accordingly, the F(v t ) are linearly independent. 

Thus B is a basis of U', and so dim V = n — r and the theorem is proved. 

6.24. Suppose f:V-*U is linear with kernel W, and that f(v) = u. Show that the "coset" 
v + W = {v + w. w &W} is the preimage of u, that is, — *> + W. 

We must prove that (i) /-»(«) C1> + W and (ii) v + W C }~ l (u). We first prove (i). Suppose 
v' ef-i(u). Then f(v') = u and so f(v'- v) =/(«') - f(v) = u - « = 0, that is, - v e W. Thus 
t' = t + (»'-«)e» + IC and hence / _1 (w)Cv + W. 

Now we prove (ii). Suppose v' € v + W. Then v' = v + w where w e Since W is the 
kernel of /, f(w) = 0. Accordingly, f(v') = f(v + w) = f(v) + f(w) = f(v) + 0 = f(v) = m. Thus 
v' e /-!(") and so t+Wc 



SINGULAR AND NONSINGULAR MAPPINGS 

6.25. Suppose F:V -* U is linear and that V is of finite dimension. Show that V and the 
image of F have the same dimension if and only if F is nonsingular. Determine all 
nonsingular mappings Till*-* R 3 . 

By Theorem 6.4, dim V = dim (ImF) + dim (Ker F). Hence V and ImF have the same di- 
mension if and only if dim (Ker F) = 0 or KerF = {0}, i.e. if and only if F is nonsingular. 

Since the dimension of R 3 is less than the dimension of R 4 , so is the dimension of the image of 
T. Accordingly, no linear mapping T : R 4 -» R 3 can be nonsingular. 



6.26. Prove that a linear mapping F:V-*U is nonsingular if and only if the image of 
an independent set is independent. 

Suppose F is nonsingular and suppose {v lr ■»„} is an independent subset of V. We claim that 

the vectors F(v x ) F(v n ) are independent. Suppose aiF(Vj) + a^Fiv^ +;•••+ a n F(v n ) - 0, where 

a, e K. Since F is linear, F(a 1 v 1 + a 2 v 2 + • • • + a n vj - 0; hence 

a 1 v 1 + a 2 v 2 + • • • + a n v n e Ker F 
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But F is nonsingular, i.e. Ker F = {0}; hence a t v t + a 2 v 2 + ■ ■ • + a n v n = 0. Since the v t are linearly 
independent, all the a { are 0. Accordingly, the FfrJ are linearly independent. In other words, the 
image of the independent set {v u . . .,v n } is independent. 

On the other hand, suppose the image of any independent set is independent. If v G V is 
nonzero, then {v} is independent. Then {F(v)} is independent and so F(v) # 0. Accordingly, F is 
nonsingular. 

OPERATIONS WITH LINEAR MAPPINGS 

6.27. Let F:R 3 ^R 2 and G:R 3 ^R 2 be denned by F{x,y,z) = (2x,y + z) and G(x,y,z) = 
(x -z,y). Find formulas denning the mappings F + G,SF and 2F-5G. 

(F + G)(x, y, z) = F(x, y, z) + G(x, y, z) 

-- (2x, y + z) + (x - z, y) = (Sx - z, 2y + z) 
(3F)(x, y, z) - 3F(x, y, z) = Z(2x, y + z) = (&x, Zy + 3z) 
(2F - 5G)(x, y, z) = 2F(x, y, z) - 5G{x, y, z) = 2{2x, y + z) - h(x - z, y) 

= (Ax, 2y + 2z) + (-5x + 5z, -5y) = (-a; + 5z, -3y + 2z) 

6.28. Let F:R 3 -»R2 and G:R 2 ^R 2 be denned by F(x,y,z) = (2x,y + z) and G(x, y) = 
(y, x) . Derive formulas denning the mappings G°F and FoG. 

(G °F)(x, y, z) = G(F(x,y,z)) = G(2x,y + z) = (y + z,2x) 
The mapping F ° G is not denned since the image of G is not contained in the domain of F. 

6.29. Show: (i) the zero mapping 0, defined by 0(v) = 0 for every v e V, is the zero ele- 
ment of Horn (V, U); (ii) the negative of F G Horn (V, U) is the mapping (-1)F, i.e. 
-F=(-1)F. 

(i) Let F G Horn (V, U). Then, for every v G V, 

(F + 0)(v) = F(v) + 0(v) = F(v) + 0 = F(v) 
Since (F + 0)(v) = F(v) for every vGV, F + 0 = F. 

(ii) For every v S V, 

(F + (-l)F)(v) = F(v) + (-l)F(v) = F(v) - F(v) = 0 = 0(d) 

Since (F + (-l)F)(v) = 0(v) for every t£V, F + (-1)F = 0. Thus (— l)f is the negative 
of F. 

6.30. Show that for F U ...,F»G Horn (V, U) and a x , . . . , a« G K, and for any v G V, 

(aiFi + a 2 F 2 + • • • + a n F n ){v) = aiF^v) + a 2 F 2 (v) + ■ ■ • + a n F n (v) 

By definition of the mapping a 1 F l , (a^^v) = a 1 F 1 (t>); hence the theorem holds for n = 1. 
Thus by induction, 

(a 1 F 1 + a 2 F 2 +---+ a n F n )(v) = {a^^v) + (a 2 F 2 +■■■+ a n F n )(v) 

= a^^v) + a 2 F 2 (v) + • • • + a n F n (v) 

6.31. Let F:B?^B, 2 , G:R 3 -*R 2 and i?:R 3 ^R 2 be defined by F(x, y, z) = (x + y + z, x + y), 
G(x, y, z) = (2x + z,x + y) and H(x, y, z) = (2y, x). Show that F,G,H G Horn (R 3 , R 2 ) 
are linearly independent. 

Suppose, for scalars a, b, c G K, 

aF+bG + cH = 0 (1) 
(Here 0 is the zero mapping.) For e x = (1, 0, 0) G R 3 , we have 

(aF + bG + cH)( ej ) = aF(l, 0, 0) + 6G(1, 0, 0) + cH(l, 0, 0) 

= a(l, 1) + 6(2, 1) + c(0, 1) = (a + 2b, a + b + c) 
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and 0(ej) = (0, 0). Thus by (1), (a + 26, a + 6 + c) = (0, 0) and so 

a + 26 = 0 and a + 6 + c = 0 (2) 

Similarly for e 2 = (0, 1, 0) G R3, we have 

(aF + bG + cH)(e 2 ) = aF(0, 1, 0) + 6G(0, 1, 0) + cH(0, 1, 0) 

= o(l, 1) + 6(0, 1) + c(2, 0) = (a + 2c, a+b) = 0(e 2 ) = (0,0) 

Thus a + 2c = 0 and a + 6 = 0 (5) 

Using (2) and (5) we obtain a = 0, 6 = 0, c = 0 W) 

Since (i) implies (4), the mappings F, G and H are linearly independent. 

6.32. Prove Theorem 6.7: Suppose dim V = m and dim U = n. Then dim Horn (V, U) - mn. 

Suppose {vj v m } is a basis of V and {it 1( ...,«„} is a basis of U. By Theorem 6.2, a linear 

mapping in Horn (V, 17) is uniquely determined by arbitrarily assigning elements of U to the basis 
elements v { of V. We define 

F {j G Horn (V, CO, i = 1 m, j = 1 n 

to be the linear mapping for which F ti (vj) = it,-, and F.jK) = 0 for k ¥> i. That is, F y maps v 4 
into it, and the other v's into 0. Observe that {F tj } contains exactly mn elements; hence the theorem 
is proved if we show that it is a basis of Horn (V, U). 

Proof that {F iS } generates Horn (V, U). Let F G Horn (V, U). Suppose F(vJ = w lt F(v 2 ) = 
w 2 , . . ., F(v m ) = w m . Since w k G U, it is a linear combination of the m's; say, 

w k = a kl u 1 + a k2 u 2 + ••• + a kn u n , k = 1, . . .,m, a v G K (1) 

m n 

Consider the linear mapping G = 2 2 <tu F u- Since G is a linear combination of the F ljt the 

i=l i=l 

proof that {F iS } generates Horn (V, U) is complete if we show that F = G. 

We now compute G(v k ), k = l m. Since F {j (v k ) = 0 for k ¥• i and F kt (v k ) = u u 

m n n n 

G(v k ) = 22 a«f«(v k ) = 2 a k jF k j(v k ) = 2 o fc j«j 

i=l j = l 3 = 1 3 = 1 

= «klWl + a fc2 M 2 + ••• + «)cnWn 

Thus by (1), G(v k ) - w k for each k. But F(v k ) = w k for each k. Accordingly, by Theorem 6.2, 
F = G; hence {Fy} generates Horn (V, U). 

Proof that {F v } is linearly independent. Suppose, for scalars a y G K, 

m n 

2 2 = o 

i=l 3=1 

For i; k , fc = 1, . . .,w, 

7n n n n 

0 = OK) = 2 t .2 aii^K) = ^OkjFfcjK) = ^"kj** 

= «lcl«l + a k2W2 + ••• + «kn M n 

But the itj are linearly independent; hence for k = 1, . . .,m, we have a kl = 0, a k2 = 0, . . ., a kn = 0. 
In other words, all the a 4j = 0 and so {F y } is linearly independent. 

Thus {Ftj} is a basis of Horn (V, V); hence dim Horn (V, U) = mn. 



6.33. Prove Theorem 6.8: Let V, U and VP be vector spaces over K. Let F, F' be linear 
mappings from V into t/ and let G,G' be linear mappings from U into W; and let 
kGK. Then: (i) G°{F + F') = GoF + GoF'; (ii) (G + G')°F = GoF + G'oF; 
(iii) fe(Gof) = (W)oF = Go(fcP). 
(i) For every v G V, 
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(G°(F + F'))(t>) = G((F + F')(v)) = G(F(v) + F'(v)) 

= G(F(v)) + G(F'(v)) = (G°F)(v) + (G°F')(v) = (G ° F + G ° F')(v) 
Since (G ° (F + F')(v) = (G ° F + G ° F')(v) for every vGV, G°(F + F') = G°F + G°F'. 

(ii) For every v GV, 

((G + G')°F)(v) = (G + G')(F(v)) = G(F(v)) + G'(F(v)) 

= (GoF)(v) + (G'°F)(v) = (G ° F + G' o F){v) 
Since ((G + G')°f)(«) = (G ° F + G ° F')(v) for every v e y, (G + G')°F = G°F + G'°F. 

(iii) For every v G V, 

(k(G°F))(v) = k(G°F)(v) = k(G(F(v))) = (kG)(F(v)) = (fcG°F)(v) 

and (k(G°F))(v) = k(G°F)(v) = k(G(F(v))) = G(fcF(v)) = G((fcF)(t>)) = (G°kF)(v) 

Accordingly, k(G°F) = (kG)°F = G°(kF). (We emphasize that two mappings are shown to 
be equal by showing that they assign the same image to each point in the domain.) 



6.34. Let F-.V^U and G.U^W be linear. Hence (G o F) : V -> W is linear. Show that 
(i) rank (Gof) ^ rank G, (ii) rank (G°F) ^ rank F. 

(i) Since F(V) c U, we also have G(F(V)) c G(I7) and so dim G(F(V)) =s dim G(?7). Then 

rank (G ° F) = dim ((G<>F)(y)) = dim (G(F(V))) ^ dim G(V) = rank G 

(ii) By Theorem 6.4, dim (G(F(V))) dimF(y). Hence 

rank (G ° F) = dim ((G<>F)(V)) = dim (G(F(V))) ^ dim F(V) = rank F 



ALGEBRA OF LINEAR OPERATORS 

6.35. Let S and T be the linear operators on R 2 defined by S(x, y) = (y, x) and T(x, y) = 
(0, x). Find formulas defining the operators S+T,2S- 3T, ST, TS, S 2 and T 2 . 

(S+T)(x,y) = S(x,y) + T(x,y) = (y, x) + (0, x) = (y,2x). 
(2S-2T)(x,y) = 2S(x,y) - ST(x,y) = 2(y,x) - Z(Q,x) = (2y,-x). 
(ST)(x,y) = S(T(x,y)) = S(0,x) = (*,0). 
(TS)(x,y) = T(S(x,y)) = T(y,x) = (0,y). 

S 2 (x, y) - S(S(x, y)) = S(y, x) = (x, y). Note S 2 = I, the identity mapping. 
T 2 (x,y) - T(T(x,y)) = T(0,x) = (0,0). Note T 2 — 0, the zero mapping. 



6.36. Let T be the linear operator on R 2 defined by 

T(3, 1) = (2, -4) and T(l, 1) = (0, 2) (1) 

(By Theorem 6.2, such a linear operator exists and is unique.) Find T(a, b). In 
particular, find T(7, 4). 

First write (a, b) as a linear combination of (3,1) and (1, 1) using unknown scalars x and y: 

(a, b) = x(3, 1) + y(l, 1) («) 

{3z + j/ = a 
x + y — b 

Solving for x and y in terms of a and 6, 

* = §a — £b and y = —la + f 6 (3) 

Now using (2), (J) and (5), 

IT(a, 6) = xT(3, 1) + »r(l, 1) = x(2, -4) + 2/(0, 2) 

= (2*, -4a;) + (0, 2y) = (2a;, -4a; + 2j/) = (a - b, 56 - 3a) 
Thus T(H, 4) = (7 - 4, 20 - 21) = (3, -1). 
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6.37. Let T be the operator on R 3 denned by T(x, y, z) = (2x, 4x -y,2x + Sy-z). (i) Show 
that T is invertible. (ii) Find a formula for T~K 

(i) The kernel W of T is the set of all (x, y, z) such that T(x, y, z) = (0, 0, 0), i.e., 

T(x,y,z) = (2x,4x-y,2x + 3y-z) = (0,0,0) 
Thus W is the solution space of the homogeneous system 

2x = 0, Ax - y = 0, 2x + Sy - z = 0 
which has only the trivial solution (0, 0, 0). Thus W = {0}; hence T is nonsingular and so by 
Theorem 6.9 is invertible. 

(ii) Let (r, s, t) be the image of (x, y, z) under T; then (x, y, z) is the image of (r, s, t) under 
T(x, y, z) = (r, s, t) and T _1 (r, s, t) = (x, y, z). We will find the values of x, y and z in terms 
of r, s and t, and then substitute in the above formula for jT -1 . From 

T(x, y, z) = (2x, 4x -y,2x + 3y-z) = (r, s, t) 

we find x = \r, y = 2r — s, 2 = 7r — 3s — t. Thus 2 1-1 is given by 

r-i(r, s, t) = (£r, 2r - s, 7r - 3s - t) 

6.38. Let V be of finite dimension and let T be a linear operator on V. Recall that 3" is 
invertible if and only if T is nonsingular or one-to-one. Show that T is invertible if 
and only if T is onto. 

By Theorem 6.4, dim V = dim (Im T) + dim (Ker T). Hence the following statements are 
equivalent: (i) T is onto, (ii) Im T = V, (iii) dim (Im T) = dim V, (iv) dim (Ker T) = 0, 
(v) Ker T = {0}, (vi) T is nonsingular, (vii) T is invertible. 

6.39. Let V be of finite dimension and let T be a linear operator on V for which TS = I, 
for some operator S on V. (We call <S a right inverse of T.) (i) Show that T is 
invertible. (ii) Show that S = T -1 . (iii) Give an example showing that the above 
need not hold if V is of infinite dimension. 

(i) Let dim V = n. By the preceding problem, T is invertible if and only if T is onto; hence T 
is invertible if and only if rank T = n. We have n = rank / = rank TS — rank T — n. 
Hence rank T = n and T is invertible. 

(ii) rr-i = r-ir = 7. Then S = IS = (T-iT)S = T-HTS) = T-iI = T-K 

(iii) Let V be the space of polynomials in * over K; say, p(t) = a 0 + a x t + a 2 t 2 + • • • + o„t™. Let T 
and S be the operators on V defined by 

T(p(t)) = 0 + «! + a 2 t + • ■ ■ + a n t"-i and S(p(t)) = o 0 t + o^ 2 + • • • + a n t n+1 
We have (TS)(p(t)) = T(S(p(t))) = T(a 0 t + + • ■ ■ + a n t^+i) 

= a„ + a,t + • • • + a n P> = p(t) 

and so TS = I, the identity mapping. On the other hand, if k S K and k ¥> 0, then (S!T)(fc) = 
S(r(fc)) = S(0) = 0 ¥■ k. Accordingly, ST # /. 

6.40. Let S and T be the linear operators on R 2 denned by S{x, y) = (0, x) and T{x, y) = 
(x, 0). Show that TS = 0 but ST # 0. Also show that T 2 = T. 

(TS)(x,y) = T(S(x,y)) = T(0,x) = (0,0). Since TS assigns 0 = (0,0) to every (»,j/)GR2, it 
is the zero mapping: TS = 0. 

(ST)(x,y) =S(T(x,y)) = S(x,0) = (0,x). For example, (ST)(4, 2) = (0, 4). Thus ST ¥> 0, since 
it does not assign 0 = (0, 0) to every element of R 2 . 

For any (x,y) € R 2 , T^(x,y) = T(T(x,y)) = T(x,0) = (x,Q) = T{x,y). Hence r 2 = T. 
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MISCELLANEOUS PROBLEMS 

6.41. Let {e u e 2 , e 3 } be a basis of V and {ft, f 2 ) a basis of U. Let T : V ■* U be linear. 
Furthermore, suppose 

T(ei) = ai/i + a 2 f 2 

T(e 2 ) = &1/1 + 62/2 and A = 
T{ez) = C1/1 + C2/2 

Show that, for any v GV, A[v] e = [T(v)] f where the vectors in K 2 and K 3 are written 
as column vectors. 

h\ 

Suppose v = k 1 e 1 + k 2 e 2 + k 3 e a ; then [v] e = \ k 2 . Also, 

w 

T(v) = kj 1 ^) + k 2 T(e 2 ) + k 3 T(e 3 ) 

= + <hf 2 ) + k 2 ihfi + 62/2) + k 3 ("ifi + C2/2) 

= + b x k 2 + c 1 k 3 )f 1 + (a 2 fci + 6 2 fc 2 + c 2 fc 3 )/ 2 

Accordmgly, [IW], = + & + 

Computing, we obtain 

ax 1 / a i 6 i c i\lfM - /Ma + M2 + "AN _ rr .„ 

= ( ft2 6a j^j - { a2kl+b2k2+eik3 ) - vw* 

6.42. Let A; be a nonzero scalar. Show that a linear map T is singular if and only if kT 
is singular. Hence T is singular if and only if — T is singular. 

Suppose T is singular. Then T(v) = 0 for some vector v ¥■ 0. Hence (kT)(v) = fe!T(t)) = fcO = 0 
and so kT is singular. 

Now suppose kT is singular. Then (kT)(w) = 0 for some vector w ¥= 0; hence T^few) = 

kT{w) = (kT){w) = 0. But k ¥= 0 and w 0 implies few ^ 0; thus T is also singular. 



6.43. Let E be a linear operator on V for which £" 2 = E. (Such an operator is termed a 
projection.) Let U be the image of E and W the kernel. Show that: (i) if u G U, 
then = u, i.e. £7 is the identity map on U; (ii) if E 1, then E is singular, i.e. 
#(1;) = 0 for some v 0; (iii) F = U®W. 

(i) If m G 17, the image of i?, then j£(d) = u for some v G V. Hence using E 2 — E, we have 

u = = £T2(v) = S<S(v)) = E(u) 

(ii) U E I then, for some v G V, = w where v ¥= u. By (i), E(u) = it. Thus 

2?(v — M ) = — .E(n) = u — u = 0 where v — w 0 

(iii) We first show that V — U + W. Let v G V. Set w = (v) and w = v - E(v). Then 

v = + « — 2?(v) — u + w 

By definition, w = E(v) G E7, the image of E. We now show that w €.W, the kernel of E: 

E(w) = E(v - E(v)) = E(v) - E 2 (v) = E(v) - E(v) = 0 

and thus w G W. Hence V — U + W. 

We next show that U n W = {0}. Let vGUnW. Since v e 17, £?(v) = v by (i). Since 
■v G tf(i>) = 0. Thus v = E(v) = 0 and so UnW - {0}. 

The above two properties imply that V = t7 0 PT. 



di Oi Ci 

a 2 &2 C2 



CHAP. 6] 



LINEAR MAPPINGS 



145 



6.44. Show that a square matrix A is invertible if and only if it is nonsingular. (Compare 
with Theorem 6.9, page 130.) 

Recall that A is invertible if and only if A is row equivalent to the identity matrix I. Thus the 
following statements are equivalent: (i) A is invertible. (ii) A and I are row equivalent, (iii) The 
equations AX = 0 and IX = 0 have the same solution space, (iv) AX = 0 has only the zero solu- 
tion, (v) A is nonsingular. 



Supplementary Problems 

MAPPINGS 

6.45. State whether each diagram defines a mapping from {1, 2, 3} into {4, 5, 6}. 




6.46. Define each of the following mappings / : R -» R by a formula: 

(i) To each number let / assign its square plus 3. 

(ii) To each number let / assign its cube plus twice the number. 

(iii) To each number — 3 let / assign the number squared, and to each number < 3 let / assign 
the number —2. 

6.47. Let /:R->R be defined by f(x) = x 2 - 4x + 3. Find (i) /(4), <ii) /(— 3), (iii) f(y - 2.x), (iv)/(*-2). 

6.48. Determine the number of different mappings from {a, 6} into {1, 2, 3}. 

6.49. Let the mapping g assign to each name in the set {Betty, Martin, David, Alan, Rebecca} the number 
of different letters needed to spell the name. Find (i) the graph of g, (ii) the image of g. 

6.50. Sketch the graph of each mapping: (i) /(*) = ^x — 1, (ii) g(x) = 2x 2 — 4x — 3. 

6.51. The mappings f:A-*B, g:B->A, h:C-*B, F-.B^C and G:A->C are illustrated in the 
diagram below. 




Determine whether each of the following defines a composition mapping and, if it does, find its 
domain and co-domain: (i) g°f, (ii) h°f, (iii) F°f, (iv) G°f, (v) g°h, (vi) h°G°g. 

6.52. Let /:R->R and g : R -» R be defined by /(*) = x 2 + Sx + 1 and g{x) = 2* - 3. Find formulas 
defining the composition mappings (i) f°g, (ii) g°f, (iii) g ° g , (iv) f°f. 

6.53. For any mapping f:A->B, show that 1 B °/ = / = /°1a- 
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6.54. For each of the following mappings / : R -> R find a formula for the inverse mapping: (i) f(x) = 
Sx - 7, (ii) /(*) = X 3 + 2. 

LINEAR MAPPINGS 

6.55. Show that the following mappings F are linear: 

(i) F : R 2 -» R 2 denned by F(x, y) = (2x - y, x). 

(ii) F : R 3 -» R 2 denned by F(x, y, z) = (z,x + y). 

(iii) F : R -» R 2 denned by F(x) = (2x, 3x). 

(iv) F : R 2 -» R 2 denned by F(«, ?/) = (ax + by, cx + dy) where a, 6, c, d e R. 

6.56. Show that the following mappings F are not linear: 

(i) F:R 2 ^R 2 defined by F(x, y) = (x 2 , y*). 

(ii) F : R 3 -» R 2 defined by F(x, y, z) = (» + 1, + z). 

(iii) F : R -» R 2 defined by F(as) = (*, 1). 

(iv) F:R 2 -»R defined by F(x, y) = \x-y\. 

6.57. Let V be the vector space of polynomials in t over K. Show that the mappings T : V -» V and 
S : V -» V defined below are linear: 

T(a a + ait+ h a n t") = a 0 t + a x t 2 + ■ ■ • + a„t» +1 

S(a 0 + Oi* + • • • + a n t») = 0 + ai + a 2 t+ ■■• + a n t^~ l 

6.58. Let V be the vector space of n x n matrices over K\ and let M be an arbitrary matrix in V. Show 
that the first two mappings T : V -» V are linear, but the third is not linear (unless M = 0): 
(i) T(A) = MA, (ii) r(A) = MA -AM, (iii) T(A) = M + A. 

6.59. Find r(a, 6) where T : R 2 -» R 3 is defined by I'd, 2) = (3, -1, 5) and r(0, 1) = (2, 1, -1). 

6.60. Find T(a, b, e) where T : R 3 -» R is defined by 

r(l, 1, 1) = 3, T(0, 1, -2) = 1 and T(0, 0, 1) = -2 

6.61. Suppose F : V -» U is linear. Show that, for any n£V, F(-v) = -F(t>). 

6.62. Let W be a subspace of V. Show that the inclusion map of W into V, denoted by i:W cV and 
defined by i(w) = w, is linear. 

KERNEL AND IMAGE OF LINEAR MAPPINGS 

6.63. For each of the following linear mappings F, find a basis and the dimension of (a) its image U 
and (6) its kernel W: 

(i) F : R 3 -» R 3 defined by F(x, y, z) = (* + 2y, y - z, x + 2z). 

(ii) F : R 2 -» R 2 defined by F(x, y) - (x + y,x + y). 

(iii) F : R 3 -» R 2 defined by F(x, y,z) = (x + y,y + z). 

6.64. Let V be the vector space of 2 X 2 matrices over R and let M = ^ _^ * ) " Let F : ^ "* V be the 

linear map defined by F(A) = MA. Find a basis and the dimension of (i) the kernel W of F and 
(ii) the image U of F. 

6.65. Find a linear mapping F : R 3 -» R 3 whose image is generated by (1, 2, 3) and (4, 5, 6). 

6.66. Find a linear mapping F : R 4 -» R 3 whose kernel is generated by (1, 2, 3, 4) and (0, 1, 1, 1). 

6.67. Let V be the vector space of polynomials in t over R. Let D : V -* V be the differential operator: 
£)(/) = df/dt. Find the kernel and image of D. 

6.68. Let F:V-*U be linear. Show that (i) the image of any subspace of V is a subspace of U and 
(ii) the preimage of any subspace of U is a subspace of V. 
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6.69. Each of the following matrices determines a linear map from K 4 into R 3 : 

/l 2 0 l\ / 1 0 2 -l\ 

(i) A = (2—1 2-1 (ii) B = 2 3-1 1 

\1 -3 2-2/ \-2 0-5 3/ 

Find a basis and the dimension of the image U and the kernel W of each map. 

6.70. Let T : C -» C be the conjugate mapping on the complex field C. That is, T(z) = z where z G C, 
or T(a +bi) = a—bi where a, b G R. (i) Show that T is not linear if C is viewed as a vector 
space over itself, (ii) Show that T is linear if C is viewed as a vector space over the real field R. 

OPERATIONS WITH LINEAR MAPPINGS 

6.71. Let F : R 3 -> R 2 and G : R 3 -» R 2 be defined by F(x, y, z) = (y,x + z) and G(x, y, z) = (2z, x - y). 
Find formulas defining the mappings F + G and 3F — 2G. 

6.72. Let H : R 2 -* R 2 be defined by H(x, y) = (y, 2x). Using the mappings F and G in the preceding 
problem, find formulas defining the mappings: (i) H°F and H°G, (ii) F°H and G°H, 
(iii)tfo(F + G) and H°F + H°G. 

6.73. Show that the following mappings F, G and H are linearly independent: 

(i) F,G,H G Horn (R 2 , R 2 ) defined by 

F(x, y) = (x, 2y), G{x, y) = (y,x + y), H(x, y) = (0, x). 

(ii) F,G,HS Horn (R 3 , R) defined by 

F(x, y, z) = x + y + z, G(x, y,z) = y + z, H(x, y,z) = x — z. 

6.74. For F,G € Horn (V, U), show that rank (F + G) — rank F + rank G. (Here V has finite 
dimension.) 

6.75. Let F :V -* U and G : U -* V be linear. Show that if F and G are nonsingular then G°F is 
nonsingular. Give an example where G°F is nonsingular but G is not. 

6.76. Prove that Horn (V, U) does satisfy all the required axioms of a vector space. That is, prove 
Theorem 6.6, page 128. 

ALGEBRA OF LINEAR OPERATORS 

6.77. Let S and T be the linear operators on R 2 defined by S{x, y) — (x + y, 0) and T(x, y) = (— y, x). 
Find formulas defining the operators S + T, 5S — 3T, ST, TS, S 2 and T*. 



6.78. Let T be the linear operator on R 2 defined by T(x, y) — {x + 2y, 3x + 4y). Find p(T) where 
p(t) = t 2 - 5t - 2. 

6.79. Show that each of the following operators T on R 3 is invertible, and find a formula for T~ l : 
(i) T(x, y, z) = (x - Sy - 2z, y - 4z, z), (ii) T(x, y, z) = {x + z,x- z, y). 

6.80. Suppose S and T are linear operators on V and that S is nonsingular. Assume V has finite dimen- 
sion. Show that rank (ST) = rank (TS) = rank T. 

6.81. Suppose V = U ® W. Let E t and E 2 be the linear operators on V defined by E t (v) = u, 
E 2 (v) = w, where v = u + w, u G U, w G W. Show that: (i) e\ = E x and e\ = E 2 , i.e. that E l 
and E 2 are "projections"; (ii) E y + E 2 = I, the identity mapping; (iii) E^E 2 = 0 and E 2 E l = 0. 

6.82. Let E l and E 2 be linear operators on V satisfying (i), (ii) and (iii) of Problem 6.81. Show that V 
is the direct sum of the image of E 1 and the image of E 2 : V = Im E 1 © Im E 2 . 

6.83. Show that if the linear operators S and T are invertible, then ST is invertible and (ST)- 1 = r -1 S -1 . 
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6.84. Let V have finite dimension, and let T be a linear operator on V such that rank (T 2 ) = rank T. 
Show that Ker T n Im T = {0}. 

MISCELLANEOUS PROBLEMS 

6.85. Suppose T : K n -* K m is a linear mapping. Let {e 1( . . . , e n } be the usual basis of K n and let A be 
the m X n matrix whose columns are the vectors T(e^j, . .., T(e n ) respectively. Show that, for every 
vector v G K n , T(v) = Av, where v is written as a column vector. 

6.86. Suppose F : V -* U is linear and k is a nonzero scalar. Show that the maps F and kF have the 
same kernel and the same image. 

6.87. Show that if F : V -* U is onto, then dim U - dim V. Determine all linear maps r : R 3 -* R 4 
which are onto. 

6.88. Find those theorems of Chapter 3 which prove that the space of «-square matrices over K is an 
associative algebra over K. 

6.89. Let T : V -* U be linear and let If be a subspace of V. The restriction of T to W is the map 
T w : W -» U defined by T w (w) = T(w), for every wGW. Prove the following, (i) T w is linear, 
(ii) Ker T w = Ker T n W. (iii) Im T w = T(W). 

6.90. Two operators S, T G A(V) are said to be similar if there exists an invertible operator P G A(V) 
for which S = P~ l TP. Prove the following, (i) Similarity of operators is an equivalence relation, 
(ii) Similar operators have the same rank (when V has finite dimension). 



Answers to Supplementary Problems 

6.45. (i) No, (ii) Yes, (iii) No. 

6.46. (i) /(*) = sc2 + 3, (ii) /(x) = x* + 2x, (iii) f(x) = \ X \ if X ~ 3 

[-2 if x < 3 

6.47. (i) 3, (ii) 24, (iii) y* - Axy + Ax 2 -Ay + Sx + 3, (iv) x 2 - 8x + 15. 

6.48. Nine. 

6.49. (i) {(Betty, 4), (Martin, 6), (David, 4), (Alan, 3), (Rebecca, 5)}. 
(ii) Image of g — {3, 4, 5, 6}. 

6.51. (i) (gof):A-*A, (ii) No, (iii) (F ° f) : A -* C, (iv) No, (v) (g°h):C^A, (vi) (h°G°g):B-*B. 

6.52. (i) (/ ° g)(x) = 4x 2 - 6x + 1 (iii) (g o g)(x) =4x-9 

(ii) (g ° f)(x) = 2x 2 + 6x - 1 (iv) (/ ° /)(«) = x* + 6a; s + 14a; 2 + 15x + 5 

6.54. (i) /-i(se) = (a: + 7)/3, (ii) f~Hx) = yfx=2. 

6.59. T(a, b) = (-a + 26, -3a + 6, 7a - 6). 

6.60. T(a, b, c) = 8a - 36 - 2c. 

6.61. F(v) + F(-v) = F(v + (-■»)) = F(0) = 0; hence F(— w) = -F(i>). 

6.63. (i) (a) {(1, 0, 1), (0, 1, -2)}, dim U = 2; (6) {(2, -1, -1)}, dim W = 1. 

(ii) (a) {(1,1)}, dim 17 = 1; (6) {(1,-1)}, dimTF=l. 

(iii) (a) {(1,0), (0,1)}, dim 17 = 2; (6) {(1, -1, 1)}, dimW = l. 
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6.64. (i) j^j o)' (o l)} baSiS ° f KerF: dim ( KerF) = 2 - 

<«) {( _2 J ) • ( o -2 )} baSiS ° f Im F: (Im F) = 2 ' 

6.65. F(a;, y, z) = (x + Ay, 2x + 5y, 3x + Gy). 

6.66. F(x, y,z,w) - (x + y-z,2x + y-w, 0). 

6.67. The kernel of D is the set of constant polynomials. The image of D is the entire space V. 

6.69. (i) (a) {(1,2,1), (0,1,1)} basis of Im A; dim(ImA) = 2. 

(6) {(4, -2, -5, 0), (1, -3, 0, 5)} basis of Ker A ; dim (Ker A) = 2. 

(ii) (a) ImB = R3; (6) {(-1,2/3,1,1)} basis of Ker B\ dim(KerB) = l. 

6.71. (F + G)(x, y, z) = (y + 2z, 2x-y + z), (SF - 2G)(x, y, z) = (Sy -Az,x + 2y + Sz). 

6.72. (i) (H a F)(x, y, z) = (x + z,2y), (H°G)(x,y,z) - (x-y,Az). (ii) Not denned. 

(iii) (H o (F + G))(x, y,z) = (H o F + H ° G)(x, y, z) = (2x - y + z, 2y + 4a). 

6.77. (S + T){x, y) = (x, x) (ST)(*, V) = (* ~V, 0) 
(5S - VT)(x, y) = (5* + 8y, -3«) (7*S)(x, j/) = (0, x + y) 

S 2 (x, y) = (x + v. 0); note that S 2 = S. 

r 2 (*. 3/) = (-*. ~V)> note that T 2 + J = 0, hence T is a zero of » 2 + 1. 

6.78. p(r) = 0. 

6.79. (i) T~Hr, s, t) = (14t + 3s + r, it + s, t), (ii) T~Hr, s, t) = ($r + t, %r - \s). 
6.87. There are no linear maps from R 3 into R 4 which are onto. 
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Matrices and Linear Operators 

INTRODUCTION 

Suppose {ei, . . . , e„} is a basis of a vector space V over a field K and, for »6F, suppose 
v = c^ei + a 2 e 2 + • • • + One*. Then the coordinate vector of v relative to {ei}, which we write 
as a column vector unless otherwise specified or implied, is 




Recall that the mapping v l-> [v] e , determined by the basis {e 4 }, is an isomorphism from V 
onto the space K n . 

In this chapter we show that there is also an isomorphism, determined by the basis 
{e,}, from the algebra A(V) of linear operators on V onto the algebra cA of n-square matrices 
over K. 

A similar result also holds for linear mappings F : V •* TJ, from one space into another. 



MATRIX REPRESENTATION OF A LINEAR OPERATOR 

Let T be a linear operator on a vector space V over a field K and suppose {ei, . . . , e n ) is 
a basis of V. Now T(ei), . . . , T(e n ) are vectors in V and so each is a linear combination of 
the elements of the basis {ei}: 

T(ei) = anei + ai 2 e 2 + • • • + ai n e n 

T(e 2 ) = d2iei + «22e 2 + • • • + a 2 ne„ 



T(e„) = dniei + an 2 e 2 + • • • + a n ne n 
The following definition applies. 

Definition: The transpose of the above matrix of coefficients, denoted by [T] e or [T], is 
called the matrix representation of T relative to the basis {e f } or simply the 
matrix of T in the basis {d}: 



an 


a 2 i 




<ll 2 


d 22 




am 


a 2n 


. . ttnn 



Example 7.1 : Let V be the vector space of polynomials in t over R of degree ^ 3, and let D : V -* V 
be the differential operator denned by D(p(t)) = d(p(t))/dt. We compute the matrix 
of D in the basis {1, t, t 2 , t 3 }. We have: 

D(l) = 0 = 0 + Of + Ot 2 + 0t s 
D(t) = 1 = 1 + Ot + Ot 2 + Ot 3 
D(t 2 ) = 2t = 0 + 2t + Of 2 + Oi 3 
D(t 3 ) = 3t 2 = 0 + Ot + 3t 2 + Ot 3 
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Accordingly, 



ID] = 




Example 7.2: 



Let T be the linear operator on R 2 denned by T(x, y) = (4x — 2y, 2x + y). We com- 
pute the matrix of T in the basis {f t = (1, 1), f 2 = (— 1, 0)}. We have 

T{fi) = 7/(1, 1) = (2, 3) = 3(1, 1) + (-1, 0) = 3A + f 2 

T(f 2 ) = T(-l, 0) = (-4, -2) = -2(1, 1) + 2(-l, 0) = -2/j + 2/ 2 



Accordingly, [2"]/ = ^ ' 



Remark: Recall that any n-square matrix A over X defines a linear operator on X" by 
the map v i-> Av (where v is written as a column vector). We show (Problem 
7.7) that the matrix representation of this operator is precisely the matrix A 
if we use the usual basis of K n . 

Our first theorem tells us that the "action" of an operator IT on a vector v is preserved 
by its matrix representation: 

Theorem 7.1: Let {e u . . ., e n } be a basis of V and let T be any operator on V. Then, for 
any vector v € V, [T] e [v] e = [T(v)] e . 

That is, if we multiply the coordinate vector of v by the matrix representation of T, 
then we obtain the coordinate vector of T{v). 

Example 7.3: Consider the differential operator D : V -+ V in Example 7.1. Let 

p(t) = a+bt + ct 2 + dtf and so D(p(t)) = b + 2ct + 3dt 2 
Hence, relative to the basis {1, *, t 2 , t 3 }, 



W*)] = 



and [D(p(t))} = 



We show that Theorem 7.1 does hold here: 



[D][p(t)] = 



1 0 


1 


0 


o\ 


la\ 






0 


0 


2 


0 






2c 


0 


0 


0 


3 




H 


3d 


\o 


0 


0 


0/ 


w 







= [B(P(«))] 



Example 7.4: Consider the linear operator r : K 2 -» R 2 in Example 7.2: T(x, y) = (4s — 2y, 2x + y). 
Let v = (5, 7). Then 

v = (5,7) = 7(1, 1) + 2(-l, 0) = 7A + 2/ 2 
T(t») = (6,17) = 17(1, 1) + 11(-1, 0) = 17/i + ll/jj 
where f t = (1, 1) and f 2 = (—1, 0). Hence, relative to the basis f 2 }, 



) 



and 



[T(v)] f = 



Using the matrix [T] f in Example 7.2, we verify that Theorem 7.1 holds here: 

™->< =GDG) - (") " ™< 
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Now we have associated a matrix [T] e to each T in A(V), the algebra of linear operators 
on V. By our first theorem the action of an individual operator T is preserved by this 
representation. The next two theorems tell us that the three basic operations with these 
operators 

(i) addition, (ii) scalar multiplication, (iii) composition 
are also preserved. 

Theorem 7.2: Let {e u ...,e»} be a basis of V over K, and let cA be the algebra of 
n-square matrices over K. Then the mapping T h» [T] e is a vector space 
isomorphism from A(V) onto cA. That is, the mapping is one-one and onto 
and, for any S, T G A(V) and any k G K, 

[T + S] e = [T]e+[S] e and [kT] e = k[T] e 

Theorem 7.3: For any operators S, T G A(V), [ST] e = [S] e [T] e . 

We illustrate the above theorems in the ease dim V - 2. Suppose {d, e 2 } is a basis of 
V, and T and S are operators on V for which 

T(ei) = aiei + a 2 e 2 S(ei) — Cid + c 2 e 2 

T(e 2 ) = biei + b 2 e 2 ' S(e 2 ) = diet + d 2 e 2 

m. = £ - «■ - £ Z 

Now we have (T + S)(ei) - T(ei) + S(ei) = aiei + a 2 e 2 + cid + c 2 e 2 

= (a! + Ci)ei + (a 2 + c 2 )e 2 

(T + S)(e 2 ) = T(e 2 ) + S(e 2 ) = Mi + b 2 e 2 + dtfi + d 2 e 2 

= (b 1 + di)ei + (b 2 + d 2 )e 2 

Thus 



= (:;: :;::;) = (:; :;)+(:; t) - ^ 

Also, for kGK, we have 

(kT)(ei) = kT(ei) = k(aiei + a 2 e 2 ) = fcoiei + ka 2 e 2 
(kT)(e 2 ) = kT(e 2 ) = k{bid + b 2 e 2 ) = kbtfi + kb 2 e 2 



I kai kbi\ , fai bi\ r , 

Finally, we have 



(ST)(ei) = S(T(ei)) = S(aiei + a 2 e 2 ) = oiS(ei) + a 2 S(e 2 ) 

= ai(ciei + c 2 e 2 ) + a 2 (diei + d 2 e 2 ) 

= (aiCi + a 2 <Zi)ei + (aic 2 + (12^2)62 

(ST)(e 2 ) = S(T(e 2 )) = S(b iei + b 2 e 2 ) = biS(ei) + b 2 S(e 2 ) 

= &i(ciei + c 2 e 2 ) + b 2 (diei + d 2 e 2 ) 

= (OiCi + & 2 di)ei + (bic 2 + b 2 d 2 )e 2 



Accordingly, 



/aiCi + a 2 di &iCi + & 2 di\ _ /ci dA/di bi\ _ 
lMJe " ^a lC2 + a 2 d 2 + " ^c 2 d 2 / \a 2 62/ L M J ° 
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CHANGE OF BASIS 

We have shown that we can represent vectors by n-tuples (column vectors) and linear 
operators by matrices once we have selected a basis. We ask the following natural question: 
How does our representation change if we select another basis? In order to answer this 
question, we first need a definition. 

Definition: Let {ei, . . .,e n ) be a basis of V and let . . .,/„} be another basis. Suppose 

fi = a n ei + a i2 e 2 + • • • + a in e n 
f% = 02161 + a^afiz + ■ ■ ■ + a 2n e n 



f n - a n iei + a„262 + • • ■ + a nil e„ 

Then the transpose P of the above matrix of coefficients is termed the transi- 
tion matrix from the "old" basis {d} to the "new" basis {/{}: 







a%\ 


. a n \ 


p = 


1 ai2 


a 22 


a n 2 




\ain 


ffi2n 


. . a n n 



We comment that since the vectors fi, . . . , f n are linearly independent, the matrix P is 
invertible (Problem 5.47). In fact, its inverse P~ l is the transition matrix from the basis 
{fi} back to the basis {ei). 



Example 7.5: Consider the following two bases of R?: 

{«! = (1, 0), e 2 = (0, 1)> and {fi = (1, D, h = (-1, 0)} 
Then /, = (1, 1) = (1, 0) + (0, 1) = e t + e 2 

f 2 = (-1,0) = -(1,0) + 0(0,1) = - ei + 0e 2 
Hence the transition matrix P from the basis {ej to the basis {/J is 

We also have e t = (1,0) = 0(1, 1) - (-1, 0) = 0/ x - / 2 

e 2 = (0,1) = (1,1) + (-1,0) = / t + / 2 
Hence the transition matrix Q from the basis back to the basis {e ( } is 

Q = 

Observe that P and Q are inverses: 

n 1\ /1 n\ 

= / 



- (-? i) 



- - c -;)(_: i) - c ') 

We now show how coordinate vectors are affected by a change of basis. 

Theorem 7.4: Let P be the transition matrix from a basis {e { } to a basis {/i} in a vector 
space V. Then, for any vector v e V, P[v] f = [v] e . Hence [v] f = P -1 [v] e . 

We emphasize that even though P is called the transition matrix from the old basis 
{d} to the new basis {/.}, its effect is to transform the coordinates of a vector in the new 
basis {ft} back to the coordinates in the old basis {et}. 



154 



MATRICES AND LINEAR OPERATORS 



[CHAP. 7 



We illustrate the above theorem in the case dim 7 = 3. Suppose P is the transition 
matrix from a basis {e u e 2 , e 3 ) of V to a basis {/i,/ 2 ,/s} of V; say, 

fi = aid + a 2 e 2 + a 3 es ai bi Ci\ 

h = &iei + b 2 e 2 + b 3 e 3 . Hence P = a 2 b% c 2 
/3 = ciei + c 2 e 2 + Cs63 \«3 &3 cs/ 

Now suppose v GV and, say, v = fci/i + fe^ + fos/s. Then, substituting for the f t from 
above, we obtain 

v = fci(aiei + a 2 e 2 + a 3 ea) + fc 2 (&iei + & 2 e 2 + b 3 e 3 ) + k 3 (ciei + c 2 e 2 + C3e 3 ) 
= (aifei + bik 2 + Cik 3 )ei + (a 2 &i + o 2 fc 2 + c 2 fc 3 )e 2 + (a 3 ki + b s k 2 + c 3 k 3 )e 3 

Thus /jgA laiki + b 1 k 2 + cik 3 \ 

[v] f = \ k 2 I and [«], - a 2 fei + & 2 fc 2 + c 2 fc 3 

\a 3 A;i + b 3 fc 2 + c 3 k 3 j 



kl\ 








k 2 


1 




and 


k 3 j 








ai 








0.2 


& 2 


c 2 




a s 


&3 


C 3 





Accordingly, / &j ,, fc i /a,fc, + b,fc, + cifc. 

P[«]y = [ a 2 & 2 c 2 ][ fe 2 ] = a 2 fei + & 2 fc 2 + c 2 fe 3 ) = [i;] f 

\a 3 A;i + 6 3 A; 2 + c 3 k 3 j 

Also, multiplying the above equation by P _1 , we have 

P~ l [v] t = P-'Pf*]/ = /[!>], = [t>]/ 

Example 7.6: Let v = (a, 6) e R 2 . Then, for the bases of R 2 in the preceding example, 

v = (a, 6) = «t(l,0) + 6(0,1) = oe! + 6e 2 
t, = ( tt , 6) = 6(1,1) + (6- a )(-i,0) = 6/j + (6 - a)f 2 

Hence [v], - (j) and [«], = ( 6 ! ff ) 

By the preceding example, the transition matrix P from {e t > to {f t } and its inverse 
p-i are given by 



We verify the result of Theorem 7.4: 



** - c -;)(.!.) - 0 - h. 



The next theorem shows how matrix representations of linear operators are affected 
by a change of basis. 

Theorem 7.5: Let P be the transition matrix from a basis {e t } to a basis {/i} in a vector 
space V. Then for any linear operator T on V, [T], = p-^TJ.P. 

Example 7.7: Let T be the linear operator on R 2 denned by T(x, y) - (4» - 2j/, 2x + y). Then for 
the bases of R 2 in Example 7.5, we have 

T(e x ) = r(l,0) = (4,2) = 4(1, 0) + 2(0, 1) = 4e t + 2e 2 
r(e 2 ) = 7/(0, 1) = (-2,1) = -2(1,0) + (0,1) = -2e t + e 2 



Accordingly, [r]. 



■ (i 1) 
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We compute [T] f using Theorem 7.5: 

m, - r-. nr - -J) - Q -\) 

Note that this agrees with the derivation of [T] f in Example 7.2. 

Remark: Suppose P - (a«) is any tt-square invertible matrix over a field K. Now if 
{d, ■ ■ ■ , e n ) is a basis of a vector space V over K, then the n vectors 

/■ = a u ei + and + • • ■ + a Bi e„, i = 1, . . . , n 

are linearly independent (Problem 5.47) and so form another basis of V. 
Furthermore, P is the transition matrix from the basis {&} to the basis {/{}. 
Accordingly, if A is any matrix representation of a linear operator T on V, 
then the matrix B = P~ l AP is also a matrix representation of T. 

SIMILARITY 

Suppose A and B are square matrices for which there exists an invertible matrix P 
such that B = P~ l AP. Then B is said to be similar to A or is said to be obtained from A 
by a similarity transformation. We show (Problem 7.22) that similarity of matrices is an 
equivalence relation. Thus by Theorem 7.5 and the above remark, we have the following 
basic result. 

Theorem 7.6: Two matrices A and B represent the same linear operator T if and only if 
they are similar to each other. 

That is, all the matrix representations of the linear operator T form an equivalence 
class of similar matrices. 

A linear operator T is said to be diagonalizable if for some basis {d) it is represented 
by a diagonal matrix; the basis is then said to diagonalize T. The preceding theorem 
gives us the following result. 

Theorem 7.7: Let A be a matrix representation of a linear operator T. Then T is 
diagonalizable if and only if there exists an invertible matrix P such that 
P~ X AP is a diagonal matrix. 

That is, T is diagonalizable if and only if its matrix representation can be diagonalized 
by a similarity transformation. 

We emphasize that not every operator is diagonalizable. However, we will show 
(Chapter 10) that every operator T can be represented by certain "standard" matrices 
called its normal or canonical forms. We comment now that that discussion will require 
some theory of fields, polynomials and determinants. 

Now suppose / is a function on square matrices which assigns the same value to similar 
matrices; that is, f(A) = f(B) whenever A is similar to B. Then / induces a function, also 
denoted by /, on linear operators T in the following natural way: f(T) = f([T] e ), where {e { } 
is any basis. The function is well-defined by the preceding theorem. 

The determinant is perhaps the most important example of the above type of functions. 
Another important example follows. 

Example 7.8: The trace of a square matrix A = (a i} ), written tr (A), is defined to be the sum of 
its diagonal elements: 

tr (A) = a n + a 22 + • • • + a„ B 

We show (Problem 7.22) that similar matrices have the same trace. Thus we can 
speak of the trace of a linear operator T; it is the trace of any one of its matrix 
representations: tr (T) = tr ([T] e ). 
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MATRICES AND LINEAR MAPPINGS 

We now consider the general case of linear mappings from one space into another. 
Let V and U be vector spaces over the same field K and, say, dim V = m and dim U = n. 
Furthermore, let {d, ...,e m } and {fi, ...,f n } be arbitrary but fixed bases of V and U 
respectively. 

Suppose F : V -» U is a linear mapping. Then the vectors F (ei), . . . , F(e m ) belong to 
U and so each is a linear combination of the ft 

F(ei) - Oil/l + 0,12/2 + • • • + Oln/n 

F(e 2 ) = a 2 i/i + a.22/2 + • • • + a 2 n/n 



F(e m ) = Omi/i + Omfifi + • • • + Omn/n 

The transpose of the above matrix of coefficients, denoted by [F]* is called the matrix 
representation of F relative to the bases {&} and {fi}, or the matrix of F in the bases {ei} 
and {ft}: 





1 an a2i 


Oml \ 


[F]t = 


0-12 0.22 


a m 2 




\ ain 0-2n 


a mn 1 



The following theorems apply. 

Theorem 7.8: For any vector v G V, [F] f e [v] e = [F(v)] f . 

That is, multiplying the coordinate vector of v in the basis {e 4 } by the matrix [F]*, we 
obtain the coordinate vector of F(v) in the basis {/ { }. 

Theorem 7.9: The mapping F h* [F]* is an isomorphism from Horn (V, U) onto the vector 
space of n x m matrices over K. That is, the mapping is one-one and onto 
and, for any F, G € Horn {V, U) and any k G.K, 

[F + G]* = [F\* + [G\! and [kF]f = k[F]' 

Remark: Recall that any n x m matrix A over if has been identified with the linear map- 
ping from K m into K n given by Av. Now suppose V and 17 are vector 
spaces over K of dimensions m and « respectively, and suppose {ei} is a basis 
of V and {A} is a basis of U. Then in view of the preceding theorem, we shall 
also identify A with the linear mapping F : V -*■ U given by [F{v)] f = A[v] e . We 
comment that if other bases of V and U are given, then A is identified with 
another linear mapping from V into U. 

Theorem 7.10: Let {e{}, {fi} and {gi} be bases of V, U and W respectively. Let F : V ■* U 
and G : U -> W be linear mappings. Then 

[GoF]° e = [G]° f [F]t 

That is, relative to the appropriate bases, the matrix representation of the composition 
of two linear mappings is equal to the product of the matrix representations of the 
individual mappings. 

We lastly show how the matrix representation of a linear mapping F : V -» U is affected 
when new bases are selected. 

Theorem 7.11: Let P be the transition matrix from a basis {e { } to a basis {el} in V, and let 
Q be the transition matrix from a basis {fi} to a basis {//} in U. Then for 
any linear mapping F : V -*■ U, 

in- = Q-nnp 
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Thus in particular, 

[F\* = Q-*[F]* 
i.e. when the change of basis only takes place in U; and 

mi = [F\iP 

i.e. when the change of basis only takes place in V. 

Note that Theorems 7.1, 7.2, 7.3 and 7.5 are special cases of Theorems 7.8, 7.9, 7.10 
and 7.11 respectively. 

The next theorem shows that every linear mapping from one space into another can be 
represented by a very simple matrix. 

Theorem 7.12: Let F:V-*U be linear and, say, rankF = r. Then there exist bases of 
V and of U such that the matrix representation of F has the form 



A = 



I 0 
0 0 



where I is the r-square identity matrix. We call A the normal or canonical 
form of F. 

WARNING 

As noted previously, some texts write the operator symbol T to the right of the vector 
v on which it acts, that is, 

vT instead of T{v) 

In such texts, vectors and operators are represented by «-tuples and matrices which are the 
transposes of those appearing here. That is, if 



then they write 



v = kid + k 2 e 2 + • • • + k n e n 



[v] e = (ki,k 2 , ...,k„) instead of [v] e = 



And if 



then they write 



T(ei) = aid + a 2 e 2 + • ■ • + a„e n 
T(e 2 ) — biei + b 2 e 2 + ■ • ■ + &„e„ 

T(e n ) = Cid + c 2 e 2 + ■ ■ • + c n e n 





Ui 


a 2 


a n \ 






hi 


bi . 


. Ci\ 


[T]e = 


bi 


b 2 


bn 


instead of 


[T]e = 


a 2 


b 2 .. 


c 2 




\Ci 


c 2 . . 


. CnJ 








bn . 


. CnJ 



This is also true for the transition matrix from one basis to another and for matrix rep- 
resentations of linear mappings F : V ■* U. We comment that such texts have theorems 
which are analogous to the ones appearing here. 
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Solved Problems 

MATRIX REPRESENTATIONS OF LINEAR OPERATORS 

7.1. Find the matrix representation of each of the following operators T on R 2 relative to 
the usual basis {ei = (1,0), e 2 = (0,1)}: 

(i) T(x, y) = (2y, Sx - y), (ii) T(x f y) = (Sx -Ay,x + 5y). 

Note first that if (a, 6) G R 2 , then (o, 6) = ae x + be 2 . 

(i) 



T( ei ) = 


r<i,o) 


= (0,3) = 


0e t + 3e 2 


T(e 2 ) = 


T(Q, 1) 


= (2,-1) = 


2e x — « 2 


2>i) = 


ra,o) 


= (3,1) = 


Se t + e 2 


T(e 2 ) = 


T(0, 1) 


= (-4,5) = 


— 4e x + 5e 2 



/o 2 

and ^(J-) 



7.2. Find the matrix representation of each operator T in the preceding problem relative 
to the basis {A = (1,3), f 2 = (2, 5)}. 

We must first find the coordinates of an arbitrary vector (a, b) G R 2 with respect to the basis 
{/J. We have 

(a, b) - x(l, 3) + 1/(2, 5) = (x + 2y, Zx + 5y) 
or x + 2y = a and Sx + 5y = b 

or x = 26 — 5a and j/ = 3a — 6 

Thus (a, 6) = (26 - 5a)A + (3a - 6)/ 2 

(i) We have T(x, y) = (2y, Sx - y). Hence 



T(f{> = r<1.8) = (6,0) = -30/! + 18/ 2 ^ = /-30 -48 s 

T(f 2 ) = T(2,5) = (10,1) = -48/i + 29/ 2 



(ii) We have T(x, y) = (3x — Ay,x + 5y). Hence 

r(/,) = r(l,3) = (-9,16) = 77/ 1 - 43/ 2 
T(f 2 ) = T(2,5) = (-14,27) = 124^ - 69/ 2 



/-30 -48 \ 
\ 18 29/ 

/ 77 124\ 
and ^ = (-43 - 69 ) 



7.3. Suppose that T is the linear operator on R 3 denned by 

T(x, y, z) = (aix + a 2 y + a 3 z, ba + b 2 y + b 3 z, cix + c 2 y + Cgz) 

Show that the matrix of T in the usual basis {e { } is given by 

' ai a 2 d3 \ 

[T]e = 

That is, the rows of [T] e are obtained from the coefficients of x, y and z in the com- 
ponents of T(x, y, z). 

T( ei ) = T(l, 0, 0) = (a x , 6 1( c t ) = + b ie2 + c^g 
T(e 2 ) - T(0, 1,0) = (a 2 , 6 2 , c 2 ) = a 2 e t + b 2 e 2 + c 2 e 3 
T(e 3 ) = T(0, 0, 1) = (a 3 , 6 3 , c 3 ) = + 6 3 e 2 + c s e s 




Accordingly, 



W\e = 




Remark: This property holds for any space K n but only relative to the usual basis 

{e 1 = (l, 0, ...,0), e 2 = (0,1,0, ...,0), e„ = (0, . . ., 0, 1)} 
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7.4. Find the matrix representation of each of the following linear operators T on R 3 
relative to the usual basis {ei = (1, 0, 0), e 2 = (0, 1, 0), e 3 — (0, 0, 1)}: 

(i) T(x,y,z) = (2x - Zy + 4z, 5x - y + 2z, Ax + ly), 

(ii) T{x,y,z) = (2y + z, x - Ay, Zx). 

12 -3 4\ /0 2 l\ 

By Problem 7.3: (i) [T] e = I 5 -1 2 , (ii) [T] e = 1 -4 0 . 

\4 7 0/ \3 0 0/ 



7.5. Let T be the linear operator on R 3 defined by T(x, y, z) = (2y + z, x — Ay, Zx). 

(i) Find the matrix of T in the basis {/i = (1, 1, 1), / 2 = (1, 1, 0), f 3 = (1, 0, 0)} 

(ii) Verify that [T], [v] f = [T{v)] t for any vector v G R 3 . 

We must first find the coordinates of an arbitrary vector (a, 6, c) G R 3 with respect to the basis 
{fitfiifai- Write (a,b,c) as a linear combination of the / { using unknown scalars x, y and z: 

(a, b, c) = x(l, 1, 1) + y(l, 1, 0) + z(l, 0, 0) 
= (x + y + z, x + y, x) 
Set corresponding components equal to each other to obtain the system of equations 

x + y + z = a, x + y = b, x = c 
Solve the system for x, y and z in terms of a, b and c to find x = c, y = b — e, z = a — b. Thus 

(a, 6, c) = c/x + (6 - e)f 2 + (a - b)f s 

(i) Since T(x, y, z) = (2y + z, x - 4y, 3*) 

r(/ t ) = r(l,l,l) = (3,-3,3) = 3/,-6/ 2 + 6/ 3 
T(f 2 ) = 7/(1,1,0) = (2,-3,3) = 3/, -6/ 2 + 5/ 3 and [T] f = 
T(f 3 ) = 7X1,0,0) = (0,1,3) = 3/ x -2/ 2 - f z 

(ii) Suppose v = (a, b, c); then 

v = (a, b, e) = ef v + (6 — c)/ 2 + (o — 6)/ 3 and so [v] f = 




Also, 

T(v) = T(a, b, c) = (26 + c, a - 46, 3a) 

= 3a/! + (-2a - 46)/ 2 + (-a + 66 + c)/ 3 



3a 

and so [T{v)] f = \ -2a - 46 

[—a + 66 + c! 



Thus / 3 3 3 < 



e 




[T}f[v], = |-6-6-2 U-c I = | -2a -46 | = [T(v)] f 
6 5 -1/ \a- b) 

7.6. Let A = ^ g ^ and let T be the linear operator on R 2 defined by T(v) = Av (where 

v is written as a column vector). Find the matrix of T in each of the following bases: 

(i) {ei = (1, 0), e 2 = (0, 1)}, i.e. the usual basis; 

(ii) {/i = (l,3), / 2 = (2,5)}. 

(i) ne ^ ( \ \)Q=(\) = ^ + * e > ... ._. n 2 



and thus [T] e = 



3 4 
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Observe that the matrix of T in the usual basis is precisely the original matrix A which 
denned T. This is not unusual. In fact, we show in the next problem that this is true for any 
matrix A when using the usual basis. 

(ii) By Problem 7.2, (a,b) = (2b-5a)f 1 + (3a-b)f 2 . Hence 

™-GX)-(i)--* + * 

and thus [T] f = ( ) 
/l 2\/2\ /12\ \ 6 10 / 



7.7. Recall that any w-square matrix A = (a«) may be viewed as the linear operator T on 
K n denned by T(v) = Av, where v is written as a column vector. Show that the 
matrix representation of T relative to the usual basis {et} of K n is the matrix A, that 
is, [T] e = A. 

' 1 
0 



r(e t ) = Ae t = 



T(e 2 ) - Ae 2 = 




\ a nl a n2 





/ 9 "\ 














)-l 






\««2/ 



= One! + a 21 e 2 + 



+ a nl e„ 



= Oi 2 ei + o 22 e 2 + — + o n2 e„ 





/ «11 °12 • • • 


<*ln \ 


M 




Am 1 




T(e n ) = Ae n = 


0 21 Ojj ... 




0 


H 


«2n 


= om^i + o 2n e 2 + • • • + o Bn e n 




\ a nl a n2 • • • 


a nn/ 






\«nn/ 




(That is, T(e$ = Ae t is the ith column of A.) Accordingly, 








' a ll 


«12 




a ln \ 






m. = 1 


°21 


a 22 




«2n 


= A 






\«nl 


a n2 




°nn / 





7.8. Each of the sets (i) {l,t,e*,te*} and (ii) {e 3t , *e 3t , £ z e st } is a basis of a vector space V 
of functions / : R -» R. Let Z> be the differential operator on V, that is, D(f) — df/dt. 
Find the matrix of D in the given basis. 



and [D] = 



(i) 0(1) = 0 = 0(1) + 0(t) + 0(e*) + O(te') 
Z>(t) = 1 = 1(1) + 0(t) + 0(e*) + 0(te*) 
£>( e t) = e » = 0(1) + 0(t) + l(e*) + 0(te*) 
D(te*) = e* + te* = 0(1) + 0(t) + l(e«) + l(te*) 

(ii) 2>(e«) = 3e»' = 3(68*) + 0(te») + 0{t^*) 
D(t&t) = e3t + 3«e3t = l(eS') + 3(«eS0 + 0(t 2 e 3t ) and [Z>] = 
D(t2 e 3t) = 2te" + 3*2e« = 0(e«) + 2(46") + 3(tV») 
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7.9. Prove Theorem 7.1: Suppose {e u ...,e n } is a basis of V and T is a linear operator 
on V. Then for any vGV, [T] e [v] e = [T(v)] e . 

Suppose, for i — 1, . . . , n, 

n 

T(ej) = a il e 1 + a a e 2 + • • • + a ln e n = 2 % e j 

)=i 

Then [T] e is the n-square matrix whose ith row is 

( a lj> a 2j> ■••»«nj) CO 
n 

Now suppose v = k 1 e 1 + k 2 e 2 + • • • + k n e n = 2 K e i 

Writing a column vector as the transpose of a row vector, 

Me = (fci,A; 2 , ...,k„)t (2) 

Furthermore, using the linearity of T, 

T(v) = T^JUifii) = jgfc^ej) = J[ * (,5 

n / n \ n 

Thus is the column vector whose ith entry is 

ay*! + a^fc; + • • • + a nj k„ (S) 

On the other hand, the jth entry of [r] e [f] e is obtained by multiplying the jth row of [T] e by [v] e , 
i.e. CO by (2). But the product of CO and (2) is (#); hence [r] e [i>] e and [T(v)] e have the same entries. 
Thus [T] e [v] e = [T(v)) e . 

7.10. Prove Theorem 7.2: Let {e u . . . , e n ) be a basis of V over K, and let be the algebra 
of n-square matrices over K. Then the mapping T >-> [T] e is a vector space isomor- 
phism from A(V) onto oA. That is, the mapping is one-one and onto and, for any 
S, T G A(V) and any JcGK, [T + S] e = [T] e + [S] e and [JcT] e = k[T] e . 

The mapping is one-one since, by Theorem 8.1, a linear mapping is completely determined by 
its values on a basis. The mapping is onto since each matrix M G a4 is the image of the linear 
operator n 

F( e i) = 2 m>\i «i * = 1» ■ • • i « 

3 = 1 

where (m i3 ) is the transpose of the matrix M. 
Now suppose, for i = 1, . . . , to, 

n n 

T(ei) = 2 o^ifj and S(e t ) = ^2 

Let A and B be the matrices A = (oy) and i? = (b v ). Then [2"] e = A* and [S] e = B*. We have, 
for i = 1, . . .,to, „ 

(r + S)( ei ) = r(e 4 ) + S(e f ) = 2K + »«)«j 

Observe that A + J? is the matrix (o y + 6 {j ). Accordingly, 

[T + S] e = {A + B) f = A* + B* = [J1.+ [S]. 

We also have, for i = 1 n, 

(kT)( ei ) = k T(e t ) = k ^2 = 2 (fc^y)^ 

Observe that fcA is the matrix (ka tj ). Accordingly, 

[kT] e = (fcA)t = kAt = k[T\ e 

Thus the theorem is proved. 
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7.11. Prove Theorem 7.3: Let {ei, . . . , e»} be a basis of V. Then for any linear operators 
S, T G A{V), [ST] e = [S] e [T] e . 

n n 

Suppose T(eJ = 2 «h«j and S(e.) = 2 &jic e if Let A and B be the matrices A = (ay) and 

j=i fc=l 

f? = (5 jfc ). Then [T] e = A* and = B t . We have 

(SD(«j) = S(r(e s )) = sf 2 ««0 = 2 %S(e } ) 

\j— i / J-i 

n , » \ n , n \ 

= 2 «y ( 2 b jk e k = 2(2 a«6j)c ) «k 



Recall that AB is the matrix AB - (c ifc ) where = ^2 a«&jk- Accordingly, 

[ST] e = (AB)' = B*A* = [S] e [T) e 



CHANGE OF BASIS, SIMILAR MATRICES 

7.12. Consider these bases of R 2 : {et = (1,0), e 2 = (0,1)} and {A = (1,3), f 2 = (2,5)}. 
(i) Find the transition matrix P from {e«} to {A}, (ii) Find the transition matrix Q 
from {A} to {e*}. (iii) Verify that Q = P~K (iv) Show that [v] f = P- 1 [v]e for any 
vector v € R 2 . (v) Show that [T^P-^TLJ* for the operator T on R 2 defined by 
T(x, y) = (2y, 3x - y). (See Problems 7.1 and 7.2.) 



(i) A = d> 3 ) = l«i + 3e 2 p = /I 2 

/ a = (2,5) = 2e x + 5e 2 



and P = (* J) 



(ii) By Problem 7.2, (a, 6) = (2b - 5a) A + (3a - 6)/ 2 . Thus 

«! = (1,0) = -5/, + 3/ 2 ^ _ ^-5 



- (1 ->) 



e 2 = (0,1) = 2/ x - / 2 

W PQ = (a -l) = (5 J) = 1 

/a\ /2b — 5a\ 

(iv) If v = (a,b), then M. = [v] 1 = { 3a - b ) ' 6 

-m. - (i _?)© - (i:_ + r) - m, 

(v) By Problems 7.1 and 7.2; [T] e = and [fy = ( J)" HenCe 

- (1 4)(S -X I) - (12 U) - ■» 



7.13. Consider the following bases of R 3 : {ei = (1,0,0), e 2 = (0,1,0), e 3 = (0,0,1)} and 
{A = (1,1,1), A = (1,1,0). / 3 = (1,0,0)}. (i) Find the transition matrix P from {e,} 
to {/i}. (ii) Find the transition matrix Q from {A} to {e s }. (iii) Verify that Q = P \ 
(iv) Show that [v] f = P _1 [v] e for any vector v G R 3 . (v) Show that [T] f = P '[TJeP 
for the T defined by T(x, y, z) = (2y + z,x- Ay, 3a;). (See Problems 7 .4 and 7.5.) 

(i) A = (1,1,1) = le x + le2+l«3 

/ 2 = (1,1,0) = lei + lea + Oej 
h = (1,0,0) = let + Oejj + Oes 



and 



P = 
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(ii) By Problem 7.5, (a, 6, c) = ef l + (b- c)f 2 + (a- b)f s . Thus 

(1,0,0) = O/j + O/a+1/3 



ej = 

e 2 = (0,1,0) = O/i + l/2-l/g 
e 3 = (0,0,1) = 1A-1/2 + 0/3 



and 







ll 


1 




1° 


0 






(iii) 


PQ = 




1 


:) 


0 


1 




H 








0 




W 


-1 








(iv) If v = (a, b, c), then [v] e = 



(v) By Problems 7.4(ii) and 7.5, [T] e 



Thus 




M/ 



0 2 1 

1-4 0 ] and [r] / 
,3 0 0, 





/o 


0 




/ 0 2 






1 


p-itn.p = 


0 


1 


-ill 


1 -4 


:) 


(i 


1 






-1 




k 3 0 






0 




Thus 



7.14. Prove Theorem 7.4: Let P be the transition matrix from a basis {e 4 } to a basis {/ { } 
in a vector space V. Then for any v G V, P[v] f = [v] e . Also, [v] f = P" 1 ^],. 

n 

Suppose, for i = 1 n, / 4 = o iiei + a i2 e 2 + • • • + a in e n = 2 a ijej . Then P is the «-square 

matrix whose jth row is * =1 

«2/, • • •» «nj) (J) 
n 

Also suppose v = k 1 f 1 + kj 2 + + kj„ = 2 Then writing a column vector as the 

transpose of a row vector, 1=1 

[v] f = ...,*;„)« ( 2 ) 

Substituting for / 4 in the equation for v, 

v = 2v< = |^(|^) = 2(2*^ 

n 

= } 2 (ayfei + a 2j fc 2 + • • • + a nj fc n )e,- 

Accordingly, [v] e is the column vector whose jth entry is 

<*lA + «2j fc 2 + • • • + a„jfe„ (5) 

On the other hand, the ;th entry of P[v] f is obtained by multiplying the ith row of P by [vh, i.e. 
(1) by (2). But the product of (1) and (2) is (8); hence P[v], and [v] e have the same entries and thus 

PM/ = Me- 

Furthermore, multiplying the above by P~» gives P -1 [v] e = P~ 1 P[v] / = [v],. 



7.15. Prove Theorem 7.5: Let P be the transition matrix from a basis {e { } to a basis {/i} in 
a vector space V. Then, for any linear operator T on V, [T], = P" 1 [7/] e P. 

For any vector P-M^M/ = P^HeMe = P- 1 !^)], = [TW],. 
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But [T\ f [v], = [T(v)] t ; hence P^[T\f[v] f = [T] f [v] f . 

Since the mapping v h* [v] f is onto K», P~i[T\ e PX = [T] f X for every X £ K n . 
Accordingly, P~ 1 [T] e P = [T\ t . 

7.16. Show that similarity of matrices is an equivalence relation, that is: (i) A is similar 
to A; (ii) if A is similar to B, then B is similar to A; (iii) if A is similar to B and B is 
similar to C then A is similar to C. 

(i) The identity matrix / is invertible and / = Since A = I~ 1 AI, A is similar to A. 

(ii) Since A is similar to B there exists an invertible matrix P such that A - P _1 BP. Hence 
B = PAP-i = (P-^-iAP-i and P" 1 is invertible. Thus B is similar to A. 

(iii) Since A is similar to B there exists an invertible matrix P such that A = p-^BP, and since 
B is similar to C there exists an invertible matrix Q such that B = Q-^CQ. Hence A = 
p-iBP = P~ 1 (Q- 1 CQ)P - (QP^iCiQP) and QP is invertible. Thus A is similar to C. 

TRACE 

7.17. The trace of a square matrix A = (a«), written tr (A), is the sum of its diagonal 
elements: tr (A) = a n + a 22 + ■ ■ • + cun. Show that (i) tr (AB) = tr (BA), (ii) if A 
is similar to B then tr (A) = tr (B). 

(i) Suppose A = (ay) and B = (b {j ). Then AB - (c ik ) where c ifc = 2 "lAfe- Thus 

tr(AB) = 2% = 2 2 
i=i i=i j=i 

n 

On the other hand, BA = (d ifc ) where d jk = 2 Thus 

tr (BA) = 2 d j} =22 M« =22 CH^i = tr (AB) 
j=i j=i i=i i=i j=i 

(ii) If A is similar to B, there exists an invertible matrix P such that A = p-^BP. Using (i), 

tr(A) = tr (P-iRP) = tr (BPP" 1 ) = tr (B) 

7.18. Find the trace of the following operator on R 3 : 

T(x, y, z) = {aix + a 2 y + a 3 z, bix + b z y + b s z, cix + c 2 y + c 3 z) 

We first must find a matrix representation of T. Choosing the usual basis {ej, 

I a i a 2 a s \ 

h b 2 b 3 

Cl c 2 C3 / 

and tr (T) = tr ([T] e ) = a t + b 2 + c 3 . 

7.19. Let V be the space of 2 x 2 matrices over R, and let M = ^3 ^ • Let T be the linear 
operator on V denned by T(A) = MA. Find the trace of T. 

We must first find a matrix representation of T. Choose the usual basis of V: 

{-.-(::)•*-(: :)■-.■(::)■«•=(::)} 
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Then 





= ME t 


- (I X 


T(E 2 ) 


= ME 2 


- (i X 


T(E 3 ) 


= ME 3 


- (1 X 


T(E 4 ) 


= ME i 


- (1 X 



0 !) 

c :) 



\E l + 0E 2 + SE 3 + 0E 4 
0E t + 1E 2 + 0E 3 + 3S 4 
2E X + 0E 2 + 4E 3 + 0E 4 
OEi + 2E 2 + 0E 3 + 4E t 




Hence 

[T]e = 

and tr (T) = 1 + 1 + 4 + 4 = 10. 



MATRIX REPRESENTATIONS OF LINEAR MAPPINGS 

7.20. Let F : R 3 R 2 be the linear mapping denned by F(x, y, z) = (3s + 2y - 4z, x - 5y + 3z). 

(i) Find the matrix of F in the following bases of R 3 and R 2 : 

{A = (1, 1, 1), U = (1, 1, 0), f 3 = (1, 0, 0)}, {fir, = (1, 3), g 2 = (2, 5)} 

(ii) Verify that the action of F is preserved by its matrix representation; that is, for 
any v € R 3 , [F] B f [v] f = [Fty)],. 

(i) By Problem 7.2, (a, 6) = (26 - 5a)g l + (3a - b)g 2 . Hence 

E(fi) = F(l,l,l) = (1,-1) = + Ag 2 

4 19 s) 

F(f 3 ) = F(1,0,0) = (3,1) = -13^ + 8fif 2 

(ii) If v = (», y, z) then, by Problem 7.5, v - zf 1 + (y- z)f 2 + (as - y)f 3 . Also, 

F(v) = (3x + 2y-4z,x-5y + 3z) = (-13* - 20y + 26z) gi + (8a + Uy - lhz)g 2 
z 



Hence M/ = ^"*J -d [F(v)] g = ( £* + ) . Thus 

r _ Bri /-7 -33 -13\/ * \ /-13* - 20j, + 26*\ 

11? W' = ( 4 19 8 ;|«J-»j = ( + 11, -15, ) = 



7.21. Let F :K n -*K m be the linear mapping defined by 

F(xi, x 2 , . . ., x n ) = (auXi + ■ • • + amXn, aziXi + • • • + a 2n x n , . . . , OmiXi + ■ ■ • + a mn x n ) 

Show that the matrix representation of F relative to the usual bases of K n and of K m 
is given by 

I Oil dl2 . . . (tin \ 

a 2 i 022 . . . a,2n \ 



[F] = 



\dml 0>m2 . . . CLm 
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That is, the rows of [F] are obtained from the coefficients of the x t in the components 
of F(xi, . . . , x n ), respectively. 



F(l, 0, . 


..,0) 


— («n> «21> • • •• a ml) 








«12 • 


• a ln 


F(0, 1, . 


..,0) 


= (a 12 , Cl 22 , • • • > a m2) 


and 


[1 = 




«22 


■ a 2n 


mo, . 


.,D 


— (®ln> a 2n> • • • > O 








a m2 . 


a mn 



7.22. Find the matrix representation of each of the following linear mappings relative to 
the usual bases of R n : 

(i) F : R 2 -» R 3 defined by F(x, y) = (Sx -y,2x + 4y, 5x - 6y) 

(ii) F : R 4 -» R 2 defined by F(x, y, s, t) = (3x - 4y + 2s - 5t, 5x + 7y - s - 2t) 

(iii) F : R 3 -» R 4 defined by F(x, y, z) = (2x + Sy - 8z, x + y + z, 4x - 5z, 6y) 

By Problem 7.21, we need only look at the coefficients of the unknowns in F(x, y, . . .). Thus 

1 % /2 3 -8\ 

(i) [F] = 2 4 (ii) [F] = (J J J I2) [F] = 




7.23. Let T:R 2 -R 2 be defined by T(x,y) = {2x-Sy,x + 4y). Find the matrix of T in 
the bases {ei = (1, 0), e 2 = (0, 1)} and {A = (1,3), /, = (2, 5)} of R 2 respectively. 
(We can view T as a linear mapping from one space into another, each having its 
own basis.) 

By Problem 7.2, (a, 6) = (26 - 5a)/i + (3a - 6)/ 2 . Then 



r(«i) = ra,o) = (2,1) = -8A+ 5/ 2 
r(e 2 ) = 7X0, D = (-3,4) = 23/1-13/2 



7.24. Let A = ^ 4 7 j. Recall that A determines a linear mapping F:R 3 -»R 2 de- 
fined by F(v) = Av where v is written as a column vector. 

(i) Show that the matrix representation of F relative to the usual basis of R 3 and 
of R 2 is the matrix A itself: [F] = A. 

(ii) Find the matrix representation of F relative to the following bases of R 3 and R 2 . 

{/1 = (1, 1, 1), h = (L 1, 0), /, = (1, 0, 0)}, {91 = (1, 3), g 2 = (2, 5)} 



(i) 



from which [F] 




(ii) By Problem 7.2, (a, 6) = (26 - 5a)gi + (3a - b)g 2 . Then 
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7.25. Prove Theorem 7.12: Let F:V-*U be linear. Then there exists a basis of V and a 

basis of U such that the matrix representation A of F has the form A = 
where / is the r-square identity matrix and r is the rank of F. 

Suppose dim V = m and dim U = n. Let W be the kernel of F and V the image of F. We 
are given that rank F — r; hence the dimension of the kernel of F is m — r. Let {w u . . .,w m _ r } 
be a basis of the kernel of F and extend this to a basis of V: 

{«!, . .., V r , W lt ...,M> m _ r } 

Set u l = F(v l ), u 2 = F(v 2 ), u r = F(v r ) 

We note that {u lt . . . , u r } is a basis of I/', the image of F. Extend this to a basis 

{«!, . . ., M r , M r+1 , . . ., lt n } 

1% + 0m 2 + • • • + 0w r + 0u r+1 + ••• + 0u n 
0% + lw 2 + • • • + 0u r + 0u r+1 + ■■• + 0u n 

F(v r ) = u T = 0u t + 0w 2 + • • • + lw r + 0w r+1 + • • • + 0u n 
F(Wj) =0 = 0w x + 0w 2 + • • ' + 0u r + 0u r+1 + •■• + 0u n 

F(w m _ r ) = 0 = 0% + 0w 2 + • • • + 0u r + 0u T+1 + ■■• + 0u n 
Thus the matrix of F in the above bases has the required form. 



of U. Observe that 

F( Vl ) = M t = 
F(v 2 ) = M 2 = 



Supplementary Problems 

MATRIX REPRESENTATIONS OF LINEAR OPERATORS 

7.26. Find the matrix of each of the following linear operators T on R 2 with respect to the usual basis 
{e t = (1, 0), e 2 = (0, 1)}: (i) T(x, y) = (2s - 3y, x + y), (ii) T(x, y) = (5* + y, 3* - 2y). 



7.27. Find the matrix of each operator T in the preceding problem with respect to the basis {/ t = (1, 2), 
f 2 = (2, 3)}. In each case, verify that [T\ f [v] f = [T(v)] f for any 



7.28. Find the matrix of each operator T in Problem 7.26 in the basis {g x = (1, 3), g 2 = (1, 4)}. 
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7.29. Find the matrix representation of each of the following linear operators T on R 3 relative to the 
usual basis: 

(i) T(x, y, z) = (x, y, 0) 

(ii) T(x,y,z) = (2x - 1y - 4z, 3x + y + 4z, 6a - 8y + z) 

(iii) T(x, y,z) = (z, y + z, x + y + z) 

7.30. Let D be the differential operator, i.e. D(f) — df/dt. Each of the following sets is a basis of a 
vector space V of functions / : R -» R. Find the matrix of D in each basis: (i) {«', e 2 *, te 2 '}, 
(ii) {sin t, cos *}, (iii) {e 5t , *e 5t , t 2 e 5 '}, (iv) {1, t, sin St, cos 3t>. 

7.31. Consider the complex field C as a vector space over the real field R. Let T be the conjugation 
operator on C, i.e. T(z) = z. Find the matrix of T in each basis: (i) {1, i), (ii) {1 + i, 1 + 2i). 



7.32. Let V be the vector space of 2 X 2 matrices over R and let M — 




. Find the matrix of each 



of the following linear operators T on V in the usual basis (see Problem 7.19) of V: (i) T(A) = MA, 
(ii) T(A) = AM, (iii) T(A) = MA — AM. 

7.33. Let l v and 0 V denote the identity and zero operators, respectively, on a vector space V. Show that, 
for any basis {ej of V, (i) [l v ] e = I, the identity matrix, (ii) [0 v ] e = 0, the zero matrix. 

CHANGE OF BASIS, SIMILAR MATRICES 

7.34. Consider the following bases of R 2 : {e t = (1, 0), e 2 = (0, 1)} and {/ x = (1, 2), / 2 = (2, 3)}. 

(i) Find the transition matrices P and Q from {ej to {/J and from {/ { } to {ej, respectively. 
Verify Q = P~K 

(ii) Show that [v] e = P[v] t for any vector v S R 2 . 

(iii) Show that [T] f - p-i [T] e P for each operator 7 1 in Problem 7.26. 

7.35. Repeat Problem 7.34 for the bases {f x = (1,2), f 2 = (2,3)} and {g t = (1,3), g 2 = (1,4)}. 

7.36. Suppose {e 1( e 2 } is a basis of V and T : V -» V is the linear operator for which T(e{) = 3e x — 2e 2 
and T(e 2 ) = e x + 4e 2 . Suppose {/i,/ 2 } is the basis of V for which / t = e t + e 2 and / 2 = 2e t + 3e 2 . 
Find the matrix of T in the basis {f x , / 2 }. 

7.37. Consider the bases B = {1, i} and B' = {1 + i, 1 + 2i} of the complex field C over the real field 
R. (i) Find the transition matrices P and Q from B to B' and from B' to B, respectively. Verify 
that Q =P~ 1 . (ii) Show that [T] B - =P- 1 [7 T ] B P for the conjugation operator T in Problem 7.31. 

7.38. Suppose {e { }, {/ 4 } and {flrj are bases of V, and that P and Q are the transition matrices from {e 4 } 
to {/J and from {/J to {ffj, respectively. Show that PQ is the transition matrix from {e{i to {jTj}. 

7.39. Let A be a 2 by 2 matrix such that only A is similar to itself. Show that A has the form 



Generalize tonXti matrices. 

7.40. Show that all the matrices similar to an invertible matrix are invertible. More generally, show that 
similar matrices have the same rank. 

MATRIX REPRESENTATIONS OF LINEAR MAPPINGS 

7.41. Find the matrix representation of the linear mappings relative to the usual bases for R n : 

(i) F : R 3 -> R 2 defined by F(x, y, z) = (2s - 4y + 9z, 5x + Sy- 2z) 

(ii) F : R 2 -» R* defined by F{x, y) = (3a + iy, 5x -2y,x + ly, 4sc) 

(iii) F : R* -» R defined by F(x, y, s,t) = 2x + Sy-7s-t 

(iv) PiR^R 2 defined by F(x) = (3«, 5x) 



A 
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7.42. Let F : R 3 -» R 2 be the linear mapping- defined by F(x, y, z) = (2x + y — z, Sx — 2y + 4z). 

(i) Find the matrix of F in the following bases of R 3 and R 2 : 

{/! = (1, 1, 1), f 2 = (1, 1, 0), f 3 = (1, 0, 0)} and { 9l = (1, 3), g 2 = (1, 4)} 

(ii) Verify that, for any vector v G R 3 , [F] B f [v] t = [F(v)] g . 

7.43. Let {ej and {/J be bases of V, and let l v be the identity mapping on V. Show that the matrix of 
l v in the bases {ej and {/ ; } is the inverse of the transition matrix P from {e ; } to {/ f }; that is, 

7.44. Prove Theorem 7.7, page 155. (Hint. See Problem 7.9, page 161.) 

7.45. Prove Theorem 7.8. (Hint. See Problem 7.10.) 

7.46. Prove Theorem 7.9. (Hint. See Problem 7.11.) 

7.47. Prove Theorem 7.10. (Hint. See Problem 7.15.) 

MISCELLANEOUS PROBLEMS 

7.48. Let T be a linear operator on V and let W be a subspace of V invariant under T, that is, 

(A B\ 

T(W) C W. Suppose dim W — to. Show that T has a matrix representation of the form f ^ 1 
where A is an to X to submatrix. * 

7.49. Let V = U © W, and let 1/ and W each be invariant under a linear operator 2 1 : V -> V. Suppose 

/A 0\ 

dim 17 = to and dim V = n. Show that T has a matrix representation of the form / ) where 

A and f? are to x to and n X n submatrices, respectively. 1 



7.50. Recall that two linear operators F and G on V are said to be similar if there exists an invertible 
operator T on V such that G = T-^FT. 

(i) Show that linear operators F and G are similar if and only if, for any basis {ej of V, the 
matrix representations [F] e and [G] e are similar matrices. 

(ii) Show that if an operator F is diagonalizable, then any similar operator G is also diagonalizable. 

7.51. Two to X n matrices A and B over K are said to be equivalent if there exists an TO-square invertible 
matrix Q and an w-square invertible matrix P such that B — QAP. 

(i) Show that equivalence of matrices is an equivalence relation. 

(ii) Show that A and B can be matrix representations of the same linear operator F : V -> U if 
and only if A and B are equivalent. 

(iii) Show that every matrix A is equivalent to a matrix of the form ( ° J where / is the r-square 
identity matrix and r = rank A. \ " ' 

7.52. Two algebras A and B over a field K are said to be isomorphic (as algebras) if there exists a bijective 
mapping / : A -* B such that for w, v € A and k G K, (i) /(w + d) = /(it) + f(v), (ii) /(few) = kf(u), 
(iii) /(wi>) = f(u)f(v). (That is, / preserves the three operations of an algebra: vector addition, scalar 
multiplication, and vector multiplication.) The mapping / is then called an isomorphism of A onto 
B. Show that the relation of algebra isomorphism is an equivalence relation. 



7.53. 



Let a4 be the algebra of w-square matrices over K, and let P be an invertible matrix in cA. Show 
that the map A h>P _1 AP, where A G cA, is an algebra isomorphism of oA onto itself. 
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7.26. 



Answers to Supplementary Problems 

» (I ■) 



7.27. Here (a,b) = (26 - 3a)/ x + (2a- b)f 2 



/-32 -45 \ / 35 41 \ 

7.28. Here (a, b) = (4a - 6) Pl + (6 - 3a)g 2 . (i) { ^ 35 j (ii) (^ _ 27 _ 32 j 



10 0 
7,29, (i) | 0 1 0 

,0 0 0, 



1 0 0' 
7,39, (i) | 0 2 1 

,0 0 2, 



(ii) 



I 2 -7 -4 s 
3 14 

\e -8 i/ 



« (° 1) 



(iii) 



18 


25 


•11 


-15 


-32 


-45 


25 


35 


0 




1 


1) 


1 




o\ 




2 




5/ 





-23 -39 
15 26 



(iv) 




7.32. (i) 





0 


6 




a 


0 




0 


d 




c 


0 



6 
0 




f * p = (1 !)• « = ( 2 -!) 
« -U 3 4). « = (J4) 

- (J ") 



7.37. P = 



i r 

1 2/' 



/2 -4 9\ 
7 - 41 - W ( 6 3- 2 ) 

/ 3 11 5\ 
[-1 -8 -3j 




(iii) (2,3,-7,-1) (iv) 



7.42. (i) 



Chapter 8 



Determinants 

INTRODUCTION 

To every square matrix A over a field K there is assigned a specific scalar called the 
determinant of A; it is usually denoted by 

det(A) or |A| 

This determinant function was first discovered in the investigation of systems of linear 
equations. We shall see in the succeeding chapters that the determinant is an indispensable 
tool in investigating and obtaining properties of a linear operator. 

We comment that the definition of the determinant and most of its properties also apply 
in the case where the entries of a matrix come from a ring (see Appendix B). 

We shall begin the chapter with a discussion of permutations, which is necessary for 
the definition of the determinant. 



PERMUTATIONS 

A one-to-one mapping o- of the set {1,2, . . .,«} onto itself is called a permutation. We 
denote the permutation <r by 

/l 2 ... n \ . . . , 

a = | . . . or v = jij2 ... Jn, where ji = <r(t) 

\Jl J2 ... Jn) 

Observe that since a is one-to-one and onto, the sequence j\ j\ . . . j n is simply a rearrange- 
ment of the numbers 1,2, . . .,». We remark that the number of such permutations is n\, 
and that the set of them is usually denoted by S„. We also remark that if o- G S n , then the 
inverse mapping <r _1 € S„; and if o-,tGS„, then the composition mapping <to t gS„. In 
particular, the identity mapping 

belongs to S n . (In fact, e = 1 2 . . . n.) 

Example 8.1: There are 2! = 2*1 = 2 permutations in S 2 : 12 and 21. 

Example 8.2: There are 3! = 3-2«l = 6 permutations in S 3 : 123, 132, 213, 231, 312, 321. 

Consider an arbitrary permutation a in S n . <r = jij' z . . . j' n . We say a is even or odd 
according as to whether there is an even or odd number of pairs (i, k) for which 

i > k but i precedes k in a (*) 

We then define the sign or parity of q, written sgn <r, by 

f 1 if a is even 
sgno- = \ 

—1 if a is odd 
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Example 8.3 : Consider the permutation a — 35142 in S 5 . 

3 and 5 precede and are greater than 1; hence (3, 1) and (5, 1) satisfy (*). 

3, 5 and 4 precede and are greater than 2; hence (3, 2), (5, 2) and (4, 2) satisfy (*). 

5 precedes and is greater than 4; hence (5, 4) satisfies (*). 

Since exactly six pairs satisfy (*), a is even and sgn a = 1. 

Example 8.4: The identity permutation e = 12...n is even since no pair can satisfy (*). 

Example 8.5 : In S 2 , 12 is even, and 21 is odd. 

In S 3 , 123, 231 and 312 are even, and 132, 213 and 321 are odd. 

Example 8.6: Let r he the permutation which interchanges two numbers i and ; and leaves the 
other numbers fixed: 

r(i) = j, r(j) = i, r(k) = k, k ¥= i, j 
We call t a transposition. If i < j, then 

t = 12 ... (t-l)j(i+l) ... (j-l)i(j+l) ... n 
There are 2{j — i — 1) + 1 pairs satisfying (*): 

(j, i), U, x), *)> where x = . . . , j— 1 
Thus the transposition t is odd. 



DETERMINANT 

Let A — (an) be an »-square matrix over a field K: 







0fl2 


<Xln 


A = 


a 2 i 




d2n 




\(tnl 


«n2 


Ctnn 



Consider a product of n elements of A such that one and only one element comes from each 
row and one and only one element comes from each column. Such a product can be written 
in the form 

<tij l a 2 j 2 . . . dnj n 

that is, where the factors come from successive rows and so the first subscripts are in the 
natural order 1, 2, . . . , n. Now since the factors come from different columns, the sequence 
of second subscripts form a permutation o- = ji j z . . . jn in S n . Conversely, each permuta- 
tion in S n determines a product of the above form. Thus the matrix A contains n\ such 
products. 

Definition: The determinant of the ^-square matrix A = (ay), denoted by det(A) or \A\, 
is the following sum which is summed over all permutations a = ]\ U • • ■ U 

in S n : _ 

1^1 = 2, (sg n ^)ai h di h . . . dn in 

cr 

That is, \A\ = 2 (sgn «r)aio-(i) 02^2) • • • QWn) 

<r£S„ 

The determinant of the w-square matrix A is said to be of order n and is frequently 
denoted by 

ttll ttl2 . . . din 
0-21 fl-22 . . • din 



dnl dn2 . . . dnn 
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We emphasize that a square array of scalars enclosed by straight lines is not a matrix but 
rather the scalar that the determinant assigns to the matrix formed by the array of scalars. 

Example 8.7: The determinant of a 1X1 matrix A — (a n ) is the scalar on itself: \A\ = o n . 
(We note that the one permutation in S t is even.) 



Example 8.8: In S 2 , the permutation 12 is even and the permutation 21 is odd. Hence 

a ll a 22 ~ a 12 a 21 



Example 8.9: 



a 21 a 22 



Thus 



4 -5 
-1 -2 



= 4(-2) - (-5)(-l) = -13 and 



a b 

e d 



= ad — be. 



In S 3 , the permutations 123, 231 and 312 are even, and the permutations 321, 213 and 
132 are odd. Hence 



°U a 12 a 13 
<I 21 022 «23 
&31 «32 ffl 3 3 



a ll a 22 a 33 a 12 ffl 23 a '31 a 13 a 21°S2 

— aj3a 2 2 a 31 ~ <*12 a 21 a 33 — a ll a 23 a 32 



This may be written as: 

a ll( a 22 a 33 ~ a 23 a 32) — a 12( a 21 a 33 — °23 a 3l) + a 13( a 21 a 32 — a 22 a 3l) 



°11 



which is a linear combination of three determinants of order two whose coefficients 
(with alternating signs) form the first row of the given matrix. Note that each 
2X2 matrix can be obtained by deleting, in the original matrix, the row and column 
containing its coefficient: 



°22 a 23 


— «12 


«21 


a 23 


+ a lS 


a 21 


°22 






<*32 a 33 


<*31 


°33 


«3l 


a 32 



a it *I2 *is 

»2J a 22 «23 

32 a 33 



- a 12 



a n Oja o ls 

a 2 i «22 «23 

«S1 *«! "33 



+ °13 



»u «hs «is 

«21 °22 «23 
031 °32 



Example 8.10: (i) 



2 3 4 
5 6 7 
8 9 1 



6 


7 




5 


7 




5 


6 






- 3 


8 




+ 4 






9 


1 




1 


8 


9 



2(6-63) - 3(5-56) + 4(45-48) = 27 



0 -4 

1 -1 

= 2(-20 + 2) - 3(0-2) - 4(0 + 4) = -46 

As n increases, the number of terms in the determinant becomes astronomical. Accord- 
ingly, we use indirect methods to evaluate determinants rather than its definition. In fact 
we prove a number of properties about determinants which will permit us to shorten the 
computation considerably. In particular, we show that a determinant of order n is equal 
to a linear combination of determinants of order n — 1 as in case n = 3 above. 



(ii) 



Z 3 

0 -4 



= 2 





0 2 




- 3 


1 5 


+ (-4) 



PROPERTIES OF DETERMINANTS 

We now list basic properties of the determinant. 
Theorem 8.1: The determinant of a matrix A and its transpose A* are equal: \A\ = |A'|. 
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By this theorem, any theorem about the determinant of a matrix A which concerns the 
rows of A will have an analogous theorem concerning the columns of A. 

The next theorem gives certain cases for which the determinant can be obtained 
immediately. 

Theorem 8.2: Let A be a square matrix. 

(i) If A has a row (column) of zeros, then |A| = 0. 

(ii) If A has two identical rows (columns), then \A\ = 0. 

(iii) If A is triangular, i.e. A has zeros above or below the diagonal, then 
|A| = product of diagonal elements. Thus in particular, |/| = 1 where 
/ is the identity matrix. 

The next theorem shows how the determinant of a matrix is affected by the "elementary" 
operations. 

Theorem 8.3: Let B be the matrix obtained from a matrix A by 

(i) multiplying a row (column) of A by a scalar k; then \B\ = k \A\. 

(ii) interchanging two rows (columns) of |A| ; then \B\ = — |A|. 

(iii) adding a multiple of a row (column) of A to another; then \B\ = |A|. 

We now state two of the most important and useful theorems on determinants. 

Theorem 8.4: Let A be any n-square matrix. Then the following are equivalent: 

(i) A is invertible, i.e. A has an inverse A -1 . 

(ii) A is nonsingular, i.e. AX — 0 has only the zero solution, or rank A = n, 
or the rows (columns) of A are linearly independent. 

(iii) The determinant of A is not zero: |A| ^ 0. 

Theorem 8.5: The determinant is a multiplicative function. That is, the determinant of 
a product of two matrices A and B is equal to the product of their deter- 
minants: \AB\ = \A\\B\. 

We shall prove the above two theorems using the theory of elementary matrices (see 
page 56) and the following lemma. 

Lemma 8.6: Let E be an elementary matrix. Then, for any matrix A, \E A\ = \E\\A\. 

We comment that one can also prove the preceding two theorems directly without 
resorting to the theory of elementary matrices. 



MINORS AND COFACTORS 

Consider an w-square matrix A = (oij). Let M« denote the (n- l)-square submatrix of 
A obtained by deleting its ith row and j'th column. The determinant |Af«| is called the minor 
of the element a« of A, and we define the cof actor of a«, denoted by A«, to be the "signed" 

minor: . . , 

A« = (-1) ,+, |M«| 

Note that the "signs" accompanying the minors form a chessboard pattern with 

+'s on the main diagonal: 

/+- + -■■ 

- + -+■• 

+ - + -■• 



We emphasize that Ma denotes a matrix whereas A« denotes a scalar. 
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Example 8.11: Let A = 




2 3 4 



Then M 2 



5 


6 


7 


8 


9 


1 



- (*. :) 



and 



= (-D 



2+3 



2 3 
8 9 



= -(18-24) = 6 



The following theorem applies 
Theorem 8.7 



The determinant of the matrix A = (<&«) is equal to the sum of the products 
obtained by multiplying the elements of any row (column) by their re- 
spective cofactors: n 

\A\ - a tl A n + aaAu + • ■ ■ + a in A in = 2 a «^« 



and 



\A\ = aijAu + a 2 jA 2 i + 



+ a„jA n j — 2 a uAij 



The above formulas, called the Laplace expansions of the determinant of A by the ith 
row and the jth column respectively, offer a method of simplifying the computation of \A\. 
That is, by adding a multiple of a row (column) to another row (column) we can reduce A 
to a matrix containing a row or column with one entry 1 and the others 0. Expanding by 
this row or column reduces the computation of \A\ to the computation of a determinant of 
order one less than that of \A\. 

5 4 2 l\ 
2 3 1 —2 

Example 8.12: Compute the determinant of A = 



-5 -7 -3 
1 -2 -1 



Note that a 1 appears in the second row, third column. Perform the following 
operations on A, where fi { denotes the t'th row: 

(i) add -2R 2 to R lt (ii) add 3B 2 to R s , (iii) add 1R 2 to # 4 . 

By Theorem 8.3(iii), the value of the determinant does not change by these opera- 
tions; that is, 



\A\ = 



5 4 2 1 
2 3 1-2 
-5 -7 -3 9 
1-2-1 4 



1 -2 

2 3 
1 2 

3 1 



0 5 

1 -2 
0 3 
0 2 



Now if we expand by the third column, we may neglect all terms which contain 0. 
Thus 

1 -2 0 5 

2 3 1-2 
12 0 3 

3 10 2 



\A\ = (-1)2+3 



-"{ 



(-2) 



+ 5 



38 



CLASSICAL ADJOINT 

Consider an n-square matrix A = (a«) over a field K: 

I On ai2 

0,21 CL2Z 



A = 



din 



\ fflnl <In2 
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The transpose of the matrix of cofactors of the elements an of A, denoted by adj A, is called 

the classical adjoint of A: 

'An A21 ... Anl 

adj A ,Al2 A * 2 ■■■ Anl 



.A in A211 ... Ai 

We say "classical adjoint" instead of simply "adjoint" because the term adjoint will be used 
in Chapter 13 for an entirely different concept. 



'2 3 -4\ 

Example 8.13: Let A = | 0 — 4 2 

a -1 5, 



The cofactors of the nine elements of A are 



A 2 i 
A31 



= + 



= + 



-4 2 
-1 5 

3 -4 
-1 5 

3 -4 
-4 2 



18, A 12 — 
-11, A 22 = + 
-10, A 32 = - 



0 2 

1 5 

2 -4 

1 5 

2 -4 
0 2 



= 2, A 18 = + 
- 14, A 23 = - 
= -4, A 33 = + 



0 -4 

1 -1 

2 3 

1 -1 

2 3 
0 -4 



= 4 
= 5 
= -8 



We form the transpose of the above matrix of cofactors to obtain the classical adjoint 
of A: 

/-18 -11 -10 
adj A = 2 14 -4 

\ 4 5-8, 



Theorem 8.8: 



For any square matrix A, 

A • (adj A) = (adj A) -A = |A|7 
where / is the identity matrix. Thus, if |A| ¥* 0, 

Observe that the above theorem gives us an important method of obtaining the inverse 
of a given matrix. 

Example 8.14: Consider the matrix A of the preceding example for which |A| = —46. We have 

/2 3 -4 
A (adj A) = 0 -4 2 
\1 -1 5 




/ 


-46 


0 


o\ 




fl 


0 o\ 




0 


-46 


0 


1 — 


0 


1 0 


\ 


0 


0 


-46/ 




\° 


0 1/ 










= -462 




\A\I 



We also have, by Theorem 8.8, 

/-18/-46 -11/-46 -10/-46' 

A-i = |jT(adjA) = 2/-46 14/-46 -4/-46 

\ 4/-46 5/-46 -8/-46, 



9/23 11/46 5/23 \ 
-1/23 -7/23 2/23 
^-2/23 -5/46 4/23/ 



APPLICATIONS TO LINEAR EQUATIONS 

Consider a system of n linear equations in n unknowns: 

auXi + auX2 + • • • + amXn = &i 

U2lXi + a%iX% + • • • + aznXn = &2 
+ a nn Xn — b n 



a n iXi + a n %xi + 
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Let A denote the determinant of the matrix A = (<&«) of coefficients: A = \A\. Also, let As 
denote the determinant of the matrix obtained by replacing the ith column of A by the 
column of constant terms. The fundamental relationship between determinants and the 
solution of the above system follows. 



Theorem 8.9: 



The above system has a unique solution if and only if A ¥> 0. 
the unique solution is given by 



In this case 



Xi = 



Ai 
A' 



A 2 

X2 = X , 



X - ^ 
A 



The above theorem is known as "Cramer's rule" for solving systems of linear equations. 
We emphasize that the theorem only refers to a system with the same number of equations 
as unknowns, and that it only gives the solution when A ¥= 0. In fact, if A = 0 the theorem 
does not tell whether or not the system has a solution. However, in the case of a homo- 
geneous system we have the following useful result. 

Theorem 8.10: The homogeneous system Ax — 0 has a nonzero solution if and only if 

A = |A| = 0. 



f 2a; — &y = 7 
Example 8.15: Solve, using determinants: < 

[3x + hy = 1 



First compute the determinant A of the matrix of coefficients: 

2 -3 

3 5 

Since A # 0, the system has a unique solution. We also have 



A = 



10 + 9 = 19 



7 -3 
1 5 



= 38, Aj, = 



-19 



Accordingly, the unique solution of the system is 

A 



X — 



38 _ „ 
19 - 2 ' 



A » -19 

y = T = IT = - 1 



We remark that the preceding theorem is of interest more for theoretical and historical 
reasons than for practical reasons. The previous method of solving systems of linear equa- 
tions, i.e. by reducing a system to echelon form, is usually much more efficient than by using 
determinants. 



DETERMINANT OF A LINEAR OPERATOR 

Using the multiplicative property of the determinant (Theorem 8.5), we obtain 
Theorem 8.11: Suppose A and B are similar matrices. Then \A\ = \B\. 

Now suppose T is an arbitrary linear operator on a vector space V. We define the 
determinant of T, written det (T), by 

det(T) = \[T] e \ 

where [T] e is the matrix of T in a basis {e*}. By the above theorem this definition is in- 
dependent of the particular basis that is chosen. 

The next theorem follows from the analogous theorems on matrices. 

Theorem 8.12: Let T and S be linear operators on a vector space V. Then 

(i) det (S o T) = det (S) ■ det (T), 

(ii) T is invertible if and only if det(T)^0. 
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We also remark that det (lv) = 1 where lv is the identity mapping, and that det (T -1 ) = 
det(r)- 1 if T is invertible. 



Example 8.16: Let T be the linear operator on R 3 defined by 

T(x, y, z) = (2* — 4y + z, x — 2y + 3z, 5x + y — z) 

f 2 -4 1^ 

The matrix of T in the usual basis of R 3 is [T] = I 1 -2 3 I . Then 



det(r) = 



2-4 1 
1-2 3 
5 1-1 



,5 1-1/ 



2(2 - 3) + 4(-l - 15) + 1(1 + 10) = -55 



MULTILINEARITY AND DETERMINANTS 

Let cA denote the set of all n-square matrices A over a field K. We may view A as an 
n-tuple consisting of its row vectors A u A 2 , . . . , A n : 

A = (Ai, A 2 , . . ., A n ) 
Hence cA may be viewed as the set of n-tuples of w-tuples in K: 

cA = {K n Y 

The following definitions apply. 

Definition: A function D:c4-*K is said to be multilinear if it is linear in each of the 
components; that is: 

(i) if row Ai = B + C, then 

D(A) = D(...,B + C, ...) = D(...,B, ...) + D(...,C, . . .); 

(ii) if row Ai = kB where k GK, then 

D(A) = D(...,kB, ...) = kD(...,B, ...). 
We also say n-linear for multilinear if there are n components. 

Definition: A function D\oA-*K is said to be alternating if D(A) = 0 whenever A has 
two identical rows: 

D(Ai,A 2 , ...,A„) = 0 whenever A 4 = A } , i¥*j 
We have the following basic result; here / denotes the identity matrix. 
Theorem 8.13: There exists a unique function D:c4 ■* K such that: 

(i) D is multilinear, (ii) D is alternating, (iii) D(I) = 1. 

This function D is none other than the determinant function; that is, for 
any matrix A G cA, D(A) = \A\. 
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Solved Problems 



COMPUTATION OF DETERMINANTS 

8.1. Evaluate the determinant of each matrix: (i) 



3 -2 

4 5 



(ii) 



a — b a 1 
a a + b , 



(i) 



3 -2 

4 5 



= 3-5 - (-2) -4 = 23. (ii) 



a — b a 
a a+b 



= (a-b)(a+b) - a' a - -b 2 . 



8.2. Determine those values of k for which 



k k 
4 2k 

k = 2, the determinant is zero. 



k k 
4 2k 



= 0. 



= 2fc2 - 4fc = 0, or 2k(k - 2) = 0. Hence k = 0; and A; = 2. That is, if fe = 0 or 



8.3. Compute the determinant of each matrix: 




/2 0 1\ 2 0 1\ 

(ii) 4 2 -3 , (iii) (32-3 
\5 3 1/ \-l -3 5/ 



-2 3 




4 


3 




4 


-2 




- 2 




+ 3 






5 -1 




2 


-1 




2 


5 



= 1 



= 1(2-15) - 2(-4-6) + 3(20 + 4) = 79 



= 2 



2 -3 

3 1 



- 0 



4 -3 

5 1 



+ 1 



4 2 

5 3 



= 2(10-9) + l(-9 + 2) = -5 



= 1(6 + 4) = 10 



= 24 



(iv) 



A o o\ 

3 2-4 
\4 1 3y 



(ai 6i Ci 

a 2 62 c 2 

<l3 &3 C3 

be used to obtain the determinant of A: 




Show that the diagrams below can 




Form the product of each of the three numbers joined by an arrow in the diagram on the left, 
and precede each product by a plus sign as follows: 

+ a 1 b 2 c 3 + 61C2O3 + Ci a 2&3 
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Now form the product of each of the three numbers joined by an arrow in the diagram on the 
right, and precede each product by a minus sign as follows: 

— 0362c! — 6 3 c 2 a! — c z a 2 b x 
Then the determinant of A is precisely the sum of the above two expressions: 
% &! c x 

\A\ = a 2 b 2 c 2 = a 1 b 2 c s + &ic 2 a 3 + c 1 a 2 b s — a 3 b 2 c 1 — b^c^ — c 3 a 2 b 1 
a 3 63 c 3 

The above method of computing \A\ does not hold for determinants of order greater than 3. 



8.5. Evaluate the determinant of each matrix: 



(i) 





(iii) 



/ 8 2 - 
1 0 -2 

\-2 3 3/ 



(i) Expand the determinant by the second column, neglecting terms containing a 0: 



2 0-1 

3 0 2 
4-3 7 



= -(-3) 



2 -1 

3 2 



= 3(4 + 3) = 21 



(ii) Use the method of the preceding problem: 
a b c 



cab 
b c a 



= a 3 + 6 s + c 3 — 06c — abc — abo = a 3 + 6 s + c 3 



3abc 



(iii) Add twice the first column to the third column, and then expand by the second row: 

3 2 -4 3 2 -4 + 2(3) 3 2 2 

10-2= 10 -2 + 2(1) = 1 0 0 1 
-2 3 3 -2 3 3 + 2(-2) -2 3 -1 



2 2 

3 -1 



= 8 



8.6. Evaluate the determinant of A = 



First multiply the first row by 6 and the second row by 4. Then 






3 -6 -2 




3 


-6 + 4(3) 


-2 - (3) 




3 


6 


-5 


6 - 4 |A| = 24|A| - 


3 2-4 




1 


2 + 4(3) 


-4 - (3) 




1 


14 


-7 




1-4 1 




1 


-4 + 4(1) 


1-(1) 




1 


0 


0 



= + 



6 -5 
14 -7 



= 28, and \A\ = 28/24 = 7/6. 




8.7. Evaluate the determinant of A = 



Note that a 1 appears in the third row, first column. Apply the following operations on A 
(where R { denotes the ith row): (i) add — 2fl 3 to R lt (ii) add 2R S to R 2 , (iii) add 1R S to i? 4 . Thus 
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|A| = 



2 5 


-3 


-2 


-2 -3 


2 


-5 


1 3 


-2 


2 


-1 -6 


4 


3 


-1 + 1 


1 


-6 


3-2 


-2 


-1 


-3 + 2 


2 


5 



0-1 1-6 

0 3-2-1 

1 3-2 2 
0-325 



= + 



0 10 

1 -2 -13 
-1 2 17 



-1 1 -6 
3 -2 -1 
-3 2 5 



1 

-1 



-13 
17 



= -4 



8.8. Evaluate the determinant of A = 



I 3 -2 -5 4^ 
-5 2 8-5 
-2 4 7-3 
\ 2 -3 -5 8/ 

First reduce A to a matrix which has 1 as an entry, such as adding twice the first row to the 
second row, and then proceed as in the preceding problem. 



\A\ = 





3 


-2 


-5 


4 




5 


2 


8 


-5 




2 


4 


7 - 


-3 




2 


-3 


-5 


8 




3 


-2 


-5 


4 




1 


-2 


-2 


3 




2 


4 


7 - 


-3 




2 


-3 


-5 


8 




3 


4 


1 - 


-5 




1 


0 


0 


0 




2 


0 


3 


3 


2 


1 


-1 


2 




4 


1 


-6 






0 


3 


0 






1 


-1 


3 







3 




-2 


-5 




4 




-5 + 2(3) 2 + 2(-2) 


8 + 2(- 


5) ■ 


-5 + 2(' 




-2 




4 


7 




-3 




2 




-3 


-5 




8 




3 - 


2 + 2(3) -5 + 2(3) 


4 


-3(3) 




1 - 


2 + 2(1) -2 + 2(1) 


3 


-3(1) 




-2 


4 + 2(- 


-2) 7 + 2(-2) 


-3 


-3(-2) 




2 - 


3 + 2(2) -5 + 2(2) 


8 


-3(2) 




4 


1 -5 




4 1 


-5 


-d) 




0 


3 3 




0 3 


3 


-(3) 




1 - 


1 2 




1 -1 


2 


-(-1) 


4 


-6 
3 












1 




-3(12 + 6) = - 


-54 





8.9. Evaluate the determinant of A = 



It + 3 -1 1 

5 t-Z 1 

6 -6 t + 4/ 

Add the second column to the first column, and then add the third column to the second column 
to obtain 



\A\ = 



t + 2 0 1 
t + 2 t-2 1 
0 t-2 t+4 



Now factor t + 2 from the first column and t — 2 from the second column to get 

10 1 

|A| = (t + 2)(t-2) 11 1 
0 1 t + 4 

Finally subtract the first column from the third column to obtain 

10 0 

11 0 
0 1 t + 4 



\A\ = (t + 2)(t-2) 



= (t+2)(t-2)(t + 4) 
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COFACTORS 

8.10. Find the cofactor of the 7 in the matrix 




2 


1 


-3 


4 


5 


-4 


7 


-2 


4 


0 


6 


-3 


3 


-2 




2 





2 


1 


4 




2 


1 


4 




4 


0 


-3 




4 


0 


-3 




3 


-2 


2 




7 


0 


10 



/_ 4 -M 

V 7 10/ 



61 



The exponent 2 + 3 comes from the fact that 7 appears in the second row, third column. 



1 2 3\ 

8.11. Consider the matrix A = 2 3 4 . (i) Compute |A|. (ii) Find adj A. (iii) Verify 

\1 5 7/ 
A • (adj A) = |A| /. (iv) Find A -1 . 



(i) |A| = 1 




= 1-20 + 21 = 2 



1 


-10 


7\ 


t 


I 1 


i 


1 


4 


-3 




-10 


4 


1 


2 


-1/ 




V i 


-3 



That is adj A is the transpose of the matrix of cof actors. Observe that the "signs" in the 

/+ - +\ 

matrix of cofactors form the chessboard pattern — + - 

\+ - +1 



(iii) A ' (adj A) = 




0 


o\ 




/I 


0 




2 


0 


I-. 


0 


1 


3 


0 


ay 




\o 


0 






= \A\I 



fa b 

8.12. Consider an arbitrary 2 by 2 matrix A = I rf 



(i) Find adj A. (ii) Show that adj (adj A) = A. 

(x) a^ a _ (_ |6| +|a| j - (_ 6 a ; - a ; 

/ d -b\ / +|a| — I— _ /* eV _ /" 6 \ _ 

(ii) adj (adj A) = adj(^_ c fl j = (_ h6| ^ - ^ dy ) - [ e d ) 



= A 
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DETERMINANTS AND SYSTEMS OF LINEAR EQUATIONS 
8.13. Solve for x and y, using determinants: 



(i) 



2x + y = 7 
Sx — 5y — 4 



(i) A = 



2 1 

3 -5 



= -13, A x = 



J/ = A u /A = 1. 



(ii) A = 



= ab, A x 



a -26 
3a -56 
—c/a, y = A u /A = —c/b. 



ax — 2by = c 
Sax — 5by — 2c 

= -39, A y = 



7 1 

4 -5 



where a& ?* 0. 

= -13. Then a: = A x /A 3, 



2 7 

3 -4 



c -26 
2c -56 



= —6c, A y = 



a c 
3a 2c 



= — ac. Then a: = A x /A = 



8.14. Solve using determinants: 



'3y + 2x = 2 + 1 
Sx + 2z - 8 - by . 
3z - 1 = x - 2y 

First arrange the system in standard form with the unknowns appearing in columns: 

2x + 3y - z = 1 
Sx + 5y + 2z = 8 
x - 2y - 3z = -1 

Compute the determinant A of the matrix A of coefficients: 
2 3-1 

2(-15 + 4) - 3(-9 - 2) - l(-6 - 5) = 22 



A = 



3 5 2 
1 -2 -3 



Since A # 0, the system has a unique solution. To obtain A x , A y and A,,, replace the coefficients of 
the unknown in the matrix A by the column of constants. Thus 



A, = 



1 3-1 
8 5 2 
-1 -2 -3 



66, 



2 1-1 

3 8 2 
1 -1 -3 



= -22, A, = 



and x = AjA = 3, y = A y /A = -1, z = AjA = 2. 



2 3 1 

3 5 8 
1 -2 -1 



= 44 



PROOF OF THEOREMS 

8.15. Prove Theorem 8.1: |A*| = |A|. 

Suppose A = (a y ). Then A* = (6 y ) where 6 y = a 3i . Hence 

= 2 (sgn a) 6 1(7 . a) 6 2(T(2) . . . 6 n0 . (n) 
aes, 

= 2 (sgn a) a aWA a„ w ,2 ■ ■ ■ <Vcn),n 
aes„ 

Let r = <r _1 . By Problem 8.36, sgn r = sgn a, and 

a <r(U,l °o-C2),2 ■ • • a <7(n),n = °1t(1) °2t(2) • • ■ "nrW 

Hence |A*| = 2 (sgn t) a lT(1) a 2T(2) . . . a BT(n) 

However, as a runs through all the elements of S B , r = <r _1 also runs through all the elements of 
S n . Thus |A*| = \A\. 



8.16. Prove Theorem 8.3(H): Let B be obtained from a square matrix A by interchanging 
two rows (columns) of A. Then |Z?| = — |A|. 

We prove the theorem for the case that two columns are interchanged. Let t be the trans- 
position which interchanges the two numbers corresponding to the two columns of A that are 
interchanged. If A = (a y ) and B — (6^), then 6 y = a iTljy Hence, for any permutation a, 
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&l<r(l) &2<K2) • ■ • W) ~ a lTO-(l) a 2T<r(2) • ■ • tt »w(«) 

Thus \B\ = 2 (sgna)^ 

o-es„ 

= 2 (sgn a) a lT(rC1) a 2 To-(2) • • • a nro(ii) 

<TGS. 

Since the transposition t is an odd permutation, sgn to- = sgn r • sgn a — — sgn <r. Thus sgn a = 
— sgn to, and so 

= — 2 (sgn ™) u lTO(1 ) % x „( 2) . . . tt nro ( B ) 
o-es„ 

But as a runs through all the elements of S n , to also runs through all the elements of S n ; hence 

\B\ = -\A\. 

8.17. Prove Theorem 8.2: (i) If A has a row (column) of zeros, then |A| = 0. (ii) If A has 
two identical rows (columns), then |A| = 0. (iii) If A is triangular, then |A| = product 
of diagonal elements. Thus in particular, |7| = 1 where I is the identity matrix. 

(i) Each term in |A| contains a factor from every row and so from the row of zeros. Thus each 
term of |A| is zero and so |A| = 0. 

(ii) Suppose 1 + 1 # 0 in K. If we interchange the two identical rows of A, we still obtain the 
matrix A. Hence by the preceding problem, |A| = — |A| and so |A| = 0. 

Now suppose 1 + 1 = 0 in K. Then sgn a = 1 for every o e S n . Since A has two iden- 
tical rows, we can arrange the terms of A into pairs of equal terms. Since each pair is 0, the 
determinant of A is zero. 

(iii) Suppose A = (%) is lower triangular, that is, the entries above the diagonal are all zero: 

= 0 whenever i < j. Consider a term * of the determinant of A: 

t = (sgn <r) o ltl .. . o nin , where a = ifa ■ • • i n 

Suppose i x ¥> 1. Then 1< i t and so a Hl = 0; hence t = 0. That is, each term for which 
i t ¥= 1 is zero. 

Now suppose i t = 1 but i 2 ¥= 2. Then 2 < i 2 and so = 0; hence t = 0. Thus each 
term for which tj »* 1 or i 2 »* 2 is zero. 

Similarly we obtain that each term for which ij ^ 1 or % # 2 or ... or i„ ¥= n is zero. 
Accordingly, \A\ = a n a 22 ■ ■ ■ «nn = product of diagonal elements. 

8.18. Prove Theorem 8.3: Let B be obtained from A by 

(i) multiplying a row (column) of A by a scalar k; then |B| = k \A\ . 

(ii) interchanging two rows (columns) of A; then |B| = - |A|. 

(iii) adding a multiple of a row (column) of A to another; then |B| = \A\. 

(i) If the jth row of A is multiplied by k, then every term in |A| is multiplied by fc and so 
|JS| = k\A\. That is, 

|B| = 2 (sgn <r) a u Ozi 2 • • ■ ■ ■ ■ a m n 

= fc 2 (sgn or) a lf a 2 i 2 • • • a m n = k \ A \ 

(ii) Proved in Problem 8.16. 

(iii) Suppose c times the fcth row is added to the jth row of A. Using the symbol /\ to denote the 
jth position in a determinant term, we have 

\B\ = 2 (sgn a) a u ^ - . . (ca fcifc + o^) . . . a Bin 

= c 2 (sgn a) o Ml ^ . . . a fcifc . . . a nin + 2 (sgn a) a 1H . . . a Hj . . . a nin 

The first sum is the determinant of a matrix whose feth and ;'th rows are identical; 
hence by Theorem 8.2(h) the sum is zero. The second sum is the determinant of A. Thus 
|B| = cO + |A| = A. 
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8.19. Prove Lemma 8.6: For any elementary matrix E, \EA\ = \E\ \A\. 

Consider the following elementary row operations: (i) multiply a row by a constant k # 0; 
(ii) interchange two rows; (iii) add a multiple of one row to another. Let E v E 2 and E 3 be the 
corresponding elementary matrices. That is, E lt E 2 and E s are obtained by applying the above 
operations, respectively, to the identity matrix /. By the preceding problem, 

\E t \ = k\I\ = k, \E 2 \ = = -1, \E 3 \ = \I\ = 1 

Recall (page 56) that E { A is identical to the matrix obtained by applying the corresponding 
operation to A. Thus by the preceding problem, 

\E t A\ = k\A\ = 1^1 \A\, \EzA\ = -\A\ = \E 2 \\A\, \E 3 A\ = \A\ = 1\A\ = \E S \\A\ 
and the lemma is proved. 



8.20. Suppose B is row equivalent to A; say B = E n E n -i . . . E 2 E X A where the E { are 
elementary matrices. Show that: 

(i) \B\ = \En\ \E n -!\ . . . \E%\ |#i| \A\, (ii) \B\ ^ 0 if and only if \A\ ^ 0. 

(i) By the preceding problem, \E^A\ = \E t \ \A\ . Hence by induction, 

\B\ = \E n \\E n _ 1 ...E 2 E l A\ = \E n \ \B n - t \ . . . |2? 2 | 1^1 \A\ 

(ii) By the preceding problem, E i ^ 0 for each i. Hence \B\ ¥> 0 if and only if |A| ^ 0. 



8.21. Prove Theorem 8.4: Let A be an n-square matrix. Then the following are equivalent: 
(i) A is invertible, (ii) A is nonsingular, (iii) \A\ ^ 0. 

By Problem 6.44, (i) and (ii) are equivalent. Hence it suffices to show that (i) and (iii) are 
equivalent. 

Suppose A is invertible. Then A is row equivalent to the identity matrix /. But |/| ¥° 0; hence 
by the preceding problem, \A\ 0. On the other hand, suppose A is not invertible. Then A is row 
equivalent to a matrix B which has a zero row. By Theorem 8.2(i), \B\ — 0; then by the preceding 
problem, |A| =0. Thus (i) and (iii) are equivalent. 

8.22. Prove Theorem 8.5: \AB\ = \A\\B\. 

If A is singular, then AB is also singular and so \AB\ - 0 = |A| \B\. On the other hand if A 
is nonsingular, then A = E n . . . E 2 E t , a product of elementary matrices. Thus, by Problem 8.20, 

]A| = |* B .. = \E K \ I^H/I = |£" n | ... ItfJI^I 

and so \AB\ = \E n ...E 2 E l B\ = \E n \ . . . \E 2 \ ^ \B\ = \A\\B\ 

8.23. Prove Theorem 8.7: Let A — (an); then \A\ = anAn + aaAm + ■ • • + a in A in , where 
An is the cofactor of a«. 

Each term in |A| contains one and only one entry of the ith row (a^a^, . . ., a in ) of A. Hence 
we can write | A | in the form 

|A| = oaAfi + ai2 A* 2 + ■■■ + a in Af n 

(Note Ay is a sum of terms involving no entry of the ith row of A.) Thus the theorem is proved if 
we can show that 

At. = A i3 - = (-D'+ilMyl 

where M {j is the matrix obtained by deleting the row and column containing the entry Oy. (His- 
torically, the expression A* iS was defined as the cofactor of a i} , and so the theorem reduces to showing 
that the two definitions of the cofactor are equivalent.) 
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First we consider the case that i — n, j = n. Then the sum of terms in |A| containing a nn is 

UmAnn ~ a nn 2 (sgn a) a 1(ra -, 0 2 <t(2) ■ • • a n-l,cr(n-l) 

where we sum over all permutations a e S n for which a(n) = n. However, this is equivalent (Prob- 
lem 8.63) to summing over all permutations of {1, . . .,n-l>. Thus A* nn - \M nn \ = |M BB |. 

Now we consider any i and j. We interchange the ith row with each succeeding row until it is 
last, and we interchange the jth column with each succeeding column until it is last. Note that the 
determinant |My| is not affected since the relative positions of the other rows and columns are not 
affected by these interchanges. However, the "sign" of \A\ and of Ay is changed n — i and then 
n — j times. Accordingly, 

A% = (-l)»-i+«-J |M„| = (-D i+ > |M„| 

8.24. Let A = (ay) and let B be the matrix obtained from A by replacing the ith row of A 
by the row vector (b a , .... b in ). Show that 

\B\ = buAa + bizA® + • • ■ + bi„A{„ 
Furthermore, show that, for j ¥= i, 

CLjlAa + djiAii + • • • + djnAin — 0 

and aijAn + a 2 jA 2 i + ■ • • + a„jA ni = 0 

Let B = (6 i3 ). By the preceding problem, 

\B\ = b a B a + bisBv + • • • + 6 jn B jn 

Since B y does not depend upon the ith row of B, B i} = A i} for j = 1 n. Hence 

\B\ = 6 i iA jl + b^Au + ••■ + b in A in 

Now let A' be obtained from A by replacing the ith row of A by the ;*th row of A. Since A' 
has two identical rows, \A'\ = 0. Thus by the above result, 

|A'| = OjiAjj + %A i2 + • • • + a jn A in = 0 
Using \A*\ - \A\, we also obtain that a u A u + a^A^ + •• • + a nj A ni = 0. 

8.25. Prove Theorem 8.8: A-(adjA) = (adjA)-A = |A|7. Thus if |A| ¥■ 0, A -1 = 
(l/|A|)(adjA). 

Let A = (a {j ) and let A • (adj A) = (6y). The ith row of A is 

(« n , a a , a in ) (1) 

Since adj A is the transpose of the matrix of cofactors, the jth column of adj A is the transpose of 
the cofactors of the ;th row of A: 

(A 3l ,A j2 , .... A jn V (2) 
Now by, the ij'-entry in A • (adj A), is obtained by multiplying (1) and (2): 

6 fj = a a A n + a^Aji + • • ■ + a in A jn 

Thus by Theorem 8.7 and the preceding problem, 

_ j\A\ if i = j 
hi - \0 if i*i 

Accordingly, A -(adj A) is the diagonal matrix with each diagonal element |A|. In other words, 
A «(adj A) = \A\1. Similarly, (adj A) -A = \A\1. 
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8.26. Prove Theorem 8.9: The system of linear equations Ax — h has a unique solution 
if and only if A = \A\ ¥= 0. In this case the unique solution is given by Xi = Ai/A, 

X2 = A 2 /A, . . ., x„ = An/A. 

By preceding results, Ax = b has a unique solution if and only if A is invertible, and A is 
invertible if and only if A = \A\ 0. 

Now suppose A#0. By Problem 8.25, A~ x = (1/A)(adj A). Multiplying Ax - b by A -1 , we 
obtain 

x = A-iAx = (1/A)(adj A)b (1) 
Note that the ith row of (l/A)(adjA) is (l/A)(A li( A 2i , . . ., A ni ). If 6 = (&„ fe 2 , . . ., b n )* then, by (1), 

x { = (1/A)(6, A u + M 2i + • • ■ + 6„A ni ) 

However, as in Problem 8.24; 

Mii + M 2i + • • • + b n A ni = A ; 

the determinant of the matrix obtained by replacing the ith column of A by the column vector 6. 
Thus x { = (l/A)Ai, as required. 

8.27. Suppose P is invertible. Show that IP" 1 ] = \P\~K 

P-iP = 7. Hence 1 = |/| = \P~ip\ = \p-i\ \P\, and so |p-i| = 

8.28. Prove Theorem 8.11: Suppose A and B are similar matrices. Then |A| = |£|. 

Since A and B are similar, there exists an invertible matrix P such that B = P-1AP. Then 
by the preceding problem, \B\ = |P"iAP| = |P~»| |A| \P\ = \A\ |P~i| |P| = |A|. 

We remark that although the matrices P _1 and A may not commute, their determinants |P _1 | 
and |A| do commute since they are scalars in the field K. 

8.29. Prove Theorem 8.13: There exists a unique function D-.cA + K such that (i) D is 
multilinear, (ii) D is alternating, (iii) D(I) = 1. This function D is the determinant 
function, i.e. D(A) = \A\. 

Let D be the determinant function: D(A) = \A\. We must show that D satisfies (i), (ii) and (iii), 
and that D is the only function satisfying (i), (ii) and (iii). 

By preceding results, D satisfies (ii) and (iii); hence we need show that it is multilinear. Suppose 
A = (««) = (A u A 2 , . . ., A n ) where A k is the feth row of A. Furthermore, suppose for a fixed i, 

Ai = Bi + Ci, where B t = (6 t b n ) and C { = (c 1( . . ., c„) 

Accordingly, ttil = 6! + ^, a i2 = & 2 + c 2 , o in = 6„ + c n 

Expanding Z)(A) = |A| by the ith row, 

Z?(A) = 2>(A, B t + C„ . . ., A„) = a u A u + a j2 A j2 + • • ■ + a in A jn 

= (&! + Cl )A u + (6 2 + c 2 )A i2 + ... +{b n + c „)A in 

= (ftiAi! + b 2 A i2 +■■•+ 6 B A iB ) + (CiAj! + C2 A, 2 + • • • + c n A in ) 

However, by Problem 8.24, the two sums above are the determinants of the matrices obtained from 
A by replacing the ith row by B t and Cj respectively. That is, 

D(A) = D(A„ ...,B, + C I ,...,A B ) 

= "(Al . . .,Bt, . . .,A„) + D(A„ . . ..d, . . ., A n ) 

Furthermore, by Theorem 8.3(i), 

D(Ai kA t A„) = feC(A, A { , ...,A„) 

Thus D is multilinear, i.e. D satisfies (iii). 
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We next must prove the uniqueness of D. Suppose D satisfies (i), (ii) and (iii). If {e lt . . . , e n } 
is the usual basis of K n , then by (iii), D(e u e 2 , ...,e n ) = D(I) = 1. Using (ii) we also have (Problem 
8.73) that 

D(e fl ,e i2 , ...,e in ) = sgna, where a = t^ 2 . . . i„ (1) 
Now suppose A = (ay). Observe that the fcth row A k of A is 

A k = («kl. a k2> ••■> a kn) = °fcl e l + a k2 e 2 + ' " " + «-kn e n 

Thus f (A) = D(a n e t H 1- a ln e„, H 1- a 2n e n , a nl e t + • • • + a nn e n ) 

Using the multilinearity of D, we can write D(A) as a sum of terms of the form 

D(A) = 1 D(a lh e {i , a 2i2 e i2 , ...,a ni eQ 

= 2 («H, «2i 2 • • • <»ni n ) «i 2 ' • ■ ■' e i tl ) 

where the sum is summed over all sequences i t i 2 . . . i n where i k £ {1, . . . , w}. If two of the indices 
are equal, say i s = i k but j ¥> k, then by (ii), 

•••> e in ) = 0 

Accordingly, the sum in (2) need only be summed over all permutations a = . . . i n . Using (1), 
we finally have that 

D(A) = 2 ("iii <hi 2 ■ ■ • a-m n ) D(e h , 

= 2 (sgn a) a u a 2l . . . a ni , where a = i x i 2 . . . i n 
Hence D is the determinant function and so the theorem is proved. 

PERMUTATIONS 

8.30. Determine the parity of a = 542163. 
Method 1. 

We need to obtain the number of pairs (i, j) for which i > j and i precedes } in a. 
3 numbers (5, 4 and 2) greater than and preceding 1, 

2 numbers (5 and 4) greater than and preceding 2, 

3 numbers (5, 4 and 6) greater than and preceding 3, 
1 number (5) greater than and preceding 4, 
0 numbers greater than and preceding 5, 
0 numbers greater than and preceding 6. 

Since 3 + 2+ 3 + 1 + 0 + 0 = 9 is odd, a is an odd permutation and so sgn a = 
Method 2. 

Transpose 1 to the first position as follows: 

to 1 5 4 2 6 3 
to 1 2 5 4 6 3 
to 1 2 3 5 4 6 
to 1 2 3 4 5 6 

Note that 5 and 6 are in the "correct" positions. Count the number of numbers "jumped": 
3 + 2 + 3 + 1 = 9. Since 9 is odd, a is an odd permutation. (Remark: This method is essentially 
the same as the preceding method.) 



5 4 2 1 6 3 
Transpose 2 to the second position: 

lTT~2 6 3 
Transpose 3 to the third position: 

1 2*if4~6^3 
Transpose 4 to the fourth position: 

1 2 3V4 6 
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Method 3. 

An interchange of two. numbers in a permutation is equivalent to multiplying the permutation 
by a transposition. Hence transform a to the identity permutation using transpositions; such as, 

5 4 2 A 6 3 

X 

1 4 2 5 6 3 

X 

1 2 4 5 6^3 

1 2 3*6 6^4 

1 2 3 4 6 5 

X 

1 2 3 4 5 6 

Since an odd number, 5, of transpositions was used, a is an odd permutation. 



8.31. Let <r = 24513 and t = 41352 be permutations in S 5 . Find (i) the composition per- 
mutations re, and ofo T) (ii) a~ x . 

Recall that a = 24513 and r = 41352 are short ways of writing 

/l 2 3 4 5\ /l 2 3 4 5\ 

' = ( 2 4 5 1 3) and T = (4 1 3 5 2) 



which means 
and 



<t(1) = 2, <r(2) = 4, <r(3) = 5, <r(4) = 1 and cr(5) = 3 
r(l) = 4, t(2) = 1, t(3) = 3, t(4) = 5 and t(5) = 2 



(i) 12345 12345 

24513 and 41352 

r I I I I j <r J. J. ^ ^ J. 

15243 12534 

Thus r'» = 15243 and o°r = 12534. 

(ii) 



j _ /2 4 5 1 3\ _ /l 2 3 4 5\ 
(^1 2 3 4 5/ ~ ^4 1 5 2 3/ 



That is, (T-i = 41523. 



8.32. Consider any permutation a = j x j 2 . . . j n . Show that for each pair (i, fc) such that 

i > k and i precedes k in ^ 

there is a pair (i*, fe*) such that 

i* < k* and <r(i*) > <r(fc*) (1) 

and vice versa. Thus cr is even or odd according as to whether there is an even or 
odd number of pairs satisfying (1). 

Choose t* and ft* so that <r(i*) = i and a(fc*) = ft. Then i > ft if and only if <r(i*) > a(ft*), 
and i precedes k in a if and only if i* < ft*. 
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8.33. Consider the polynomial g — g(xi, ...,x n ) = Y[(Xi — Xj). Write out explicitly the 
polynomial g — g{xi, x 2 , x s , x*). 

The symbol n is used for a product of terms in the same way that the symbol 2 is used for a 
sum of terms. That is, JI (^i — x j) means the product of all terms (x { — x } ) for which i < j. Hence 

9 - g{%l, • • • . *4> = (*1 — a^X*! — x s) (*1 _ x i)i x 2 - x s)( x 2 ~ x i)( x 3 ~ x i) 



8.34. Let o- be an arbitrary permutation. For the polynomial g in the preceding problem, 
define <r(g) - J~J (avco - x aW ). Show that 

{g if a is even 
-g if <x is odd 

Accordingly, a(g) = (sgn a)g. 
Since a is one-one and onto, 

a(g) = n (* ff a) - = . .Ft (Xi-Xj) 

i < j t < 3 or i > ) 

Thus a(g) — g or a(g) = —g according as to whether there is an even or an odd number of terms 
of the form (x t - x } ) where i > j. Note that for each pair (i, j) for which 

i < j and tr{i) > o(j) (1) 

there is a term (x aU) - x„ w ) in a(g) for which a(t) > a(i). Since a is even if and only if there is an 
even number of pairs satisfying (1), we have o(g) = g if and only if a is even; hence a(g) = —g 
if and only if a is odd. 



8.35. Let <t,t G S n . Show that sgn (to,,) = (sgn r)(sgn cr). Thus the product of two even 
or two odd permutations is even, and the product of an odd and an even permutation 
is odd. 

Using the preceding problem, we have 

sgn(T°<r)0 = (t°j)(j) = r(a(g)) = r((sgn o)g) - (sgn x)(sgn a)g 
Accordingly, sgn(i-°<r) = (sgn r)(sgn <r). 



8.36. Consider the permutation <r = hfa . . . j*. Show that sgn a" 1 = sgn a and, for scalars 
0^10^2 . . . a in n = aikjOskj . . ■ Onk n where a' 1 = kikt . . . k n 

We have a' 1 "a - e, the identity permutation. Since c is even, cr 1 and a are both even or 
both odd. Hence sgn <r _1 = sgn <r. 

Since a — jj 2 ■ ■ ■ in is a permutation, . . . a ln n = a lk[ a 2fc2 ■ • • a„k„- Then k v k 2 , . . . , k n 

have the property that 

ff(fcj) = 1, (r(fc 2 ) = 2, . . ., a(k n ) = n 

Let t = ■■ ■ K- Then for i = 1. • • • » w, 

(cr°r)(i) = a{r(i)) = cikj = i 

Thus a°r — e, the identity permutation; hence t = a -1 . 
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MISCELLANEOUS PROBLEMS 

8.37. Find det (T) for each linear operator T: 

(i) T is the operator on R 3 denned by 

T(x,y,z) = (2x-z,x + 2y~4:Z,3x-3y + z) 

(ii) T is the operator on the vector space V of 2-square matrices over K denned by 

'a b y 
c d, 

1 2 0 -l 1 

(i) Find the matrix representation of T relative to, say, the usual basis: [T] = I 1 2—4 



T(A) = MA where M 



-3 l, 



Then 



det (T) = 



2 0-1 
1 2-4 
3-3 1 



= 2(2-12) - l(-3-6) = -11 



(ii) Find a matrix representation of T in some basis of V, say, 



{-( 



Then 



T(E 2 ) 
T(E 3 ) 
T(E 4 ) 



a 0 



Thus [T] E = 



det (T) = 



a 0 
0 
b 



1 ON 
0 0/ 


. E 2 — 


c :)• 


£3 : 


■G 


(c d)(o o) 


■c 


S) 




(c d)(o o) 




:) 


= OB, 


/a 6' 
\c d 


)G :) 


(d o) 


= bE, 


la b\/0 ON 
\c d){o l) 


(o d) 


= 0E 


c 0\ 










0 c 1 
d 0 


and 








0 dj 
c 0 




a 0 c 




0 a 


0 c 
d 0 
0 d 


= a 


0 d 0 
6 0 d 


+ c 


6 0 
0 6 



= a 2 d2 + 62 C 2 - 2a6cd 



H 1 1\ 

8.38, Find the inverse of A = | 0 1 1 

^0 0 1/ 

The inverse of A is of the form (Problem 8.53): A -1 
Set A A -1 = /, the identity matrix: 




A A -1 = 




/l x + 1 y + z + 1 
=0 1 z+ 1 
\0 0 1 





/I 


0 




H 


0 


1 


•:) 




\o 


0 





Set corresponding entries equal to each other to obtain the system 

* + 1 = 0, y + z + 1 = 0, z + l = 0 



= / 
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1-1 <n 

The solution of the system is x = — 1, y = 0, z = — 1. Hence A -1 = 0 1—1 

\0 0 1, 

A -1 could also be found by the formula A -1 = (adj A)/|A| . 

8.39. Let D be a 2-linear, alternating function. Show that D{A, B) = — D(B, A). More 
generally, show that if D is multilinear and alternating, then 

D(...,A, ...,B, ...) = -£(..., B, .... A, ...) 
that is, the sign is changed whenever two components are interchanged. 
Since D is alternating, D(A + B, A + B) = 0. Furthermore, since D is multilinear, 
0 = D(A+B,A+B) = D(A,A + B) + D(B,A + B) 

= D(A, A) + D(A, B) + D(B, A) + D(B, B) 
But D(A, A) = 0 and D(B, B) = 0. Hence 

0 = D(A, B) + D(B, A) or D(A, B) = - D(B, A) 

Similarly, 0 = D(. . ., A +B, . . ., A + B, . . .) 

= B(...,A, A, ...) + D(...,A B, ...) 

+ D(...,B, ...,A, ...) + D(...,B, ...,B, ...) 
= D(...,A, ...,B, ...) + D(...,B, ...,A, ...) 
and thus D(. . ., A B, . . .) = - B(. . ., B, . . ., A, . . .). 

8.40. Let F be the vector space of 2 by 2 matrices M = ^ j over r. Determine 

whether or not D : V -» R is 2-linear (with respect to the rows) if (i) I>(M) = a + o\ 
(ii) D(M) = ad. 

(i) No. For example, suppose A = (1, 1) and B = (3, 3). Then 

B(A,B) = J) = 4 and B(2A, B) = B (J J) = 5 # 2B(A, B) 

(ii) Yes. Let A = (a 1( a 2 ), B = (6 lt b 2 ) and C=(c 1( c 2 ); then 

B(A,C) = DQ J) = a lC2 and D(B,C) = b(** = V 2 

Hence for any scalars s,(£B, 

0( 8 A + tB,C) = l>/ s<l i + f6 » ™*+ tb A = ( 8ai + t&1 ) C2 
\ c l c 2 / 

= s(o 1 c 2 ) + t(6 lC2 ) = sD(A,C) + tD(B,C) 

That is, B is linear with respect to the first row. 
Furthermore, 

B(C, A) = bQ J) = c l02 and B(C B) = D (* j) = ^ 

Hence for any scalars i,f£K, 

B(C, sA + *B) = B ( C * "2 ) = C!(sa 2 + 16 2 ) 

= s(c x aj) + t{ Cl b 2 ) = sD(C,A) + tD(C,B) 

That is, D is linear with respect to the second row. 
Both linearity conditions imply that D is 2-linear. 
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Supplementary Problems 

COMPUTATION OF DETERMINANTS 

(2 5 \ / 6 1 \ 

4 j, (ii) f 3 2 ) • 

8.42. Compute the determinant of each matrix: (i) /* J" 3 V (ii) 5 7 V 

V ~ 4 4 - 1/ \ -1 t + 3/ 

8.43. For each matrix in the preceding problem, find those values of t for which the determinant is zero. 

8.44. Compute the determinant of each matrix: 

/2 1 l\ /3 -2 -4\ 1-2 -1 4\ 

(i) 0 5 -2 , (ii) 2 5 -1 , (iii) 6-3-2 
\1 -3 4/ \0 6 1/ I 4 1 2/ 

8.45. Evaluate the determinant of each matrix: 

It -2 4 3 
(i) 1 t + 1 -2 
\ 0 0 t-Al 






8.46. For each matrix in the preceding problem, determine those values of t for which the determinant 
is zero. 



8.47. Evaluate the determinant of each matrix: (i) 



Ii 


2 


2 


3 \ 


l 


0 


-2 


0 


3 


-1 


1 


-2 


\4 


-3 


0 


■/ 



, (ii) 




COFACTORS, CLASSICAL ADJOINTS, INVERSES 

1 2-2 3^ 

8.48. For the matrix | 3 * jj ? I , find the cof actor of: 
4 0 2 1 

1 7 2 - 3/ 
(i) the entry 4, (ii) the entry 5, (iii) the entry 7. 

'l 1 0^ 

8.49. Let A = | 1 1 1 | . Find (i) adj A, (ii) A -1 . 
i0 2 1, 



'1 2 2^ 
8.5(1. Let A = | 3 1 0 

a i i, 



Find (i) adj A, (ii) A~K 



8.51. Find the classical adjoint of each matrix in Problem 8.47. 

8.52. Determine the general 2 by 2 matrix A for which A = adj A. 

8.53. Suppose A is diagonal and B is triangular; say, 

/ dx 0 ... 0 

A = 



0 a 2 
0 0 









c 12 • 


• 


and 


B = 


0 


b 2 ■ 


• c 2n 






\0 


0 


■ 6„ / 
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(i) Show that adj A is diagonal and adj B is triangular. 

(ii) Show that B is invertible iff all 6 4 0; hence A is invertible iff all o 4 # 0. 

(iii) Show that the inverses of A and £ (if either exists) are of the form 

h 



d 12 . 






■ d 2n 


0 





That is, the diagonal elements of A -1 and are the inverses of the corresponding diagonal 
elements of A and B. 

DETERMINANT OF A LINEAR OPERATOR 

8.54. Let T be the linear operator on R 3 denned by 

T(x, y, z) = (3x - 2z, 5y + 1z, x + y + z) 

Find det (T). 

8.55. Let D : V -» V be the differential operator, i.e. D(v) = dv/d*. Find det (D) if V is the space gen- 
erated by (i) {l,t, .... t n }, (ii) {e*, e*', e^*}, (iii) {sin t, cos t}. 

8.56. Prove Theorem 8.12: Let T and S be linear operators on V. Then: 

(i) det (S°T} = det (S) • det (?); (ii) T is invertible if and only if det (T) ¥■ 0. 

8.57. Show that: (i) det(l v ) = 1 where l v is the identity operator; (ii) det(7 1 - 1 ) = det (J/) -1 if T is 
invertible. 

DETERMINANTS AND LINEAR EQUATIONS 

f Zx + 5j/ = 8 

8.58. Solve by determinants: (i) ^ , . , (n 

[ Ax — 2j/ = 1 

(2x-5y + 2z = 7 
a + 2y - 4z = 3 (i 
3s — 4j/ — 6z = 5 

8.60. Prove Theorem 8.10: The homogeneous system Ax = 0 has a nonzero solution if and only if 
A = \A\ — 0. 

PERMUTATIONS 

8.61. Determine the parity of these permutations in S 5 : (i) a = 32154, (ii) r = 13 524, (iii) ir = 42 5 31. 

8.62. For the permutations a, t and v in Problem 8.61, find (i) r°a, (ii) v°<r, (iii) (iv) r _1 . 

8.63. Let t € 5„. Show that t ° a runs through S„ as a runs through S„; that is, S n - {r ° a : a e S n }. 

8.64. Let jSS, have the property that a(n) = n. Let a* e S B _t be defined by o*(x) = <r(x). (i) Show 
that sgn <r* = sgn a. (ii) Show that as a runs through S„, where (r(n) = n, c* runs through 
S n _!; that is, S B _! = {<r* : <r S S n , <r(«) = n}. 

MULTILINEARITY 

8.65. Let V = (K m ) m , i.e. V is the space of m-square matrices viewed as m-tuples of row vectors. Let 
D : V -» K. 

(i) Show that the following weaker statement is equivalent to D being alternating: 

D(A U A 2 , A„) = 0 

whenever Aj = A i+1 for some i. 

(ii) Suppose D is m-linear and alternating. Show that if A u A 2 ,...,A m are linearly dependent, 
then D(A U ...,A m ) = 0. 



2* - 3y 


= -1 


Ax + 7y 


= -1 " 


2z + 3 


= y + 3x 


x - 3z 


= 2y + l. 


Sy + z 


- 2 - 2x 
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8.66. Let V be the space of 2 by 2 matrices M = ^" over R. Determine whether or not 



D : V -* R is 2-linear (with respect to the rows) if (i) D(M) = ac-bd, (ii) D(Af) = ab - ed, 
(iii) D(Af) = 0, (iv) D(M) = 1. 

8.67. Let V be the space of M-square matrices over K. Suppose B e V is invertible and so det (B) 0. 
Define D:V-*K by D(A) = det (Afl)/det (B) where A G V. Hence 

D(A 1( A 2 , . . . , A n ) = det (A 1 B, A 2 B, AJ?)/det (B) 

where A ( is the tth row of A and so A f B is the ith row of AB. Show that D is multilinear and 
alternating, and that D(I) = 1. (Thus by Theorem 8.13, D(A) = det (A) and so det (AB) = 
det (A) det (B). This method is used by some texts to prove Theorem 8.5, i.e. \AB\ = |A| |B|.) 

MISCELLANEOUS PROBLEMS 

8.68. Let A be an w-square matrix. Prove \kA\ — k n |A|. 



8.69. Prove: 



1 *i x\ ... aj -1 
1 x 2 x\ ... xj" 1 



= (-1)" n (asi-*,) 
><l 



The above is called the Vandermonde determinant of order n. 

8.70. Consider the block matrix AT = ^ ^ where A and C are square matrices. Prove |Af| = |A| |C|. 

More generally, prove that if M is a triangular block matrix with square matrices A,, . . ., A m on 
the diagonal, then |M| = |A t | |A 2 | •• -|AJ. 

8.71. Let A, B, C and D be commuting n-square matrices. Consider the 2»-square block matrix 

M = (c a) • Prove that |M| = |A| |c| ~ |B| |cl - 

8.72. Suppose A is orthogonal, that is, A*A = I. Show that |A| = ±1. 

8.73. Consider a permutation <r = }J 2 . . . j n . Let {ej be the usual basis of X», and let A be the matrix 
whose ith row is ej., i.e. A = (e^, e 32 , . . ., e } J. Show that |A| = sgn a. 

8.74. Let A be an n-square matrix. The determinantal rank of A is the order of the largest submatrix 
of A (obtained by deleting rows and columns of A) whose determinant is not zero. Show that the 
determinantal rank of A is equal to its rank, i.e. the maximum number of linearly independent rows 
(or columns). 



Answers to Supplementary Problems 

8.41. (i) -18, (ii) -15. 

8.42. (i) «2 - St - 10, (ii) «2 - 2t - 8. 

8.43. (i) t = 5, t = -2; (ii) t = 4, t - -2. 

8.44. (i) 21, (ii) -11, (iii) 100, (iv) 0. 
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8.45. (i) (t + 2)(t - 3)(t - 4), (ii) (f + 2)2(t-4), (iii) (t + 2)2(t-4). 

8.46. (i) 3, 4, -2; (ii) 4, -2; (iii) 4, -2. 

8.47. (i) -131, (ii) -55. 

8.48. (i) -135, (ii) -103, (iii) -31. 



8.49. adj A = 



'-1 -1 1' 
-1 1 -1 
2-2 0, 



A-i = (adjA)/|A| = 



l-l 1 0, 



8.50. adj A — — 



1 0-2 
3-1 6 

2 1-5/ 



A-i = 



(-1 0 

3 1 - 
V-2 -1 5/ 



8.51. (i) 



8.52. A 



8.54. det(T) = 4. 



(-16 -29 -26 -2 

-30 -38 -16 29 

-8 51 -13 -1 

\-13 1 28 -18/ 

'k 0" 



(ii) 



/ 21 -14 -17 -19 \ 

-44 11 33 11 

-29 1 13 21 

\ 17 7 -19 -18/ 



8.55. (i) 0, (ii) 6, (iii) 1. 

8.58. (i) x = 21/26, y = 29/26; (ii) x = -5/13, y = 1/13. 

8.59. (i) * = 5, y = 1, 2 = 1. (ii) Since A = 0, the system cannot be solved by determinants. 

8.61. sgn <r = 1, sgn t = —1, sgn v = -1. 

8.62. (i) r»<r = 53142, (ii) w°o = 52413, (iii) <r-» = 32154, (iv) t' 1 = 14253. 
8.66. (i) Yes, (ii) No, (iii) Yes, (iv) No. 
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Eigenvalues and Eigenvectors 

INTRODUCTION 

In this chapter we investigate the theory of a single linear operator T on a vector 
space V of finite dimension. In particular, we find conditions under which T is diago- 
nalizable. As was seen in Chapter 7, this question is closely related to the theory of 
similarity transformations for matrices. 

We shall also associate certain polynomials with an operator T: its characteristic 
polynomial and its minimum polynomial. These polynomials and their roots play a major 
role in the investigation of T. We comment that the particular field K also plays an im- 
portant part in the theory since the existence of roots of a polynomial depends on K. 

POLYNOMIALS OF MATRICES AND LINEAR OPERATORS 

Consider a polynomial f(t) over a field K: f(t) = aj, n + • • • + ait + ao. If A is a square 
matrix over K, then we define 

/(A) = OnA n + ■ ■ • + diA + a 0 I 

where I is the identity matrix. In particular, we say that A is a root or zero of the poly- 
nomial f(t) if /(A) = 0. 

Example 9.1: Let A= (g > and let /(t) = 2*2 - 3i + 7, g{t) = P - 5« - 2. Then 

«*»-•(; Ihil 
- ■ G !) -CD- c:) 

Thus A is a zero of g(t). 
The following theorem applies. 

Theorem 9.1: Let / and g be polynomials over K, and let A be an w-square matrix over K. 
Then 

(i) (f + g)(A) = f(A) + g(A) 

(ii) (fg)(A) = f(A)g(A) 
and, for any scalar k G K, 

(iii) (kf){A) = kf{A) 

Furthermore, since f(t) g(t) = g(t) f(t) for any polynomials f{t) and g(t), 

f(A)g(A) = g(A)f(A) 
That is, any two polynomials in the matrix A commute. 
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Now suppose T : V -» V is a linear operator on a vector space V over K. If f(t) — 
Ont n + ■ • • + ait + ao, then we define f(T) in the same way as we did for matrices: 

f{T) = a n T« + ■■■ + aiT + a 0 I 

where / is now the identity mapping. We also say that T is a zero or root of f(t) if f{T) = 0. 
We remark that the relations in Theorem 9.1 hold for operators as they do for matrices; 
hence any two polynomials in T commute. 

Furthermore, if A is a matrix representation of T, then f(A) is the matrix representation 
of f{T). In particular, f{T) = 0 if and only if f(A) = 0. 



EIGENVALUES AND EIGENVECTORS 

Let T : V -* V be a linear operator on a vector space V over a field K. A scalar A G K 
is called an eigenvalue of T if there exists a nonzero vector v G V for which 

T(v) = \v 

Every vector satisfying this relation is then called an eigenvector of T belonging to the 
eigenvalue A.. Note that each scalar multiple kv is such an eigenvector: 

T(kv) - kT(v) - - k(\v) - \{kv) 

The set of all such vectors is a subspace of V (Problem 9.6) called the eigenspace of \. 

The terms characteristic value and characteristic vector (or: proper value and proper 
vector) are frequently used instead of eigenvalue and eigenvector. 

Example 9.2: Let I:V->V be the identity mapping. Then, for every »€V, I(v) = v = lv. 

Hence 1 is an eigenvalue of /, and every vector in V is an eigenvector belonging to 1. 



Example 9.3: Let T : R 2 -» R 2 be the linear operator which 
rotates each vector «£R ! by an angle 
$ = 90°. Note that no nonzero vector is a 
multiple of itself. Hence T has no eigen- 
values and so no eigenvectors. 

Example 9.4: Let D be the differential operator on the vector 
space V of differentiable functions. We have 
D(e st ) = 5e 5t . Hence 5 is an eigenvalue of D 
with eigenvector e 5t . 

If A is an w-square matrix over K, then an eigenvalue of A means an eigenvalue of A 
viewed as an operator on K n . That is, A G K is an eigenvalue of A if, for some nonzero 
(column) vector v G K n , 

K ' Av = \v 




In this case v is an eigenvector of A belonging to A. 



Example 9.5: Find eigenvalues and associated nonzero eigenvectors of the matrix A = ( 
We seek a scalar t and a nonzero vector X = I \ such that AX — tX: 

[I DO - h 

The above matrix equation is equivalent to the homogeneous system 

I - l)x - 2y = 0 
-2)2/ = 0 



(i 



f x + 2y = tx f (t - l)x - 

\3x + 2y = ty ° T \-Sx + (t - '. 



(1) 
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Recall that the homogeneous system has a nonzero solution if and only if the de- 
terminant of the matrix of coefficients is 0: 



t - 1 -2 
-3 t - 2 



= «2 - 3t - 4 = (* — 4)(t+ 1) = 0 



Thus t is an eigenvalue of A if and only if t = 4 or t = —1. 

Setting t = 4 in (i), 

3a; — 2t/ = 0 

„ or simply Sx — 2y — 0 

-3x + 2y = 0 

Thus « = = is a nonzero eigenvector belonging to the eigenvalue t = 4, 

and every other eigenvector belonging to t = 4 is a multiple of v. 
Setting t = -1 in (i), 

f-2* - 2j/ = 0 . ■ 

„ or simply x + 1/ = 0 

-3* - 3j/ = 0 

Thus 10 = ^*^=^ ^ is a nonzero eigenvector belonging to the eigenvalue 
t = —1, and every other eigenvector belonging to t = — 1 is a multiple of w. 

The next theorem gives an important characterization of eigenvalues which is fre- 
quently used as its definition. 

Theorem 9.2: Let T : V -* V be a linear operator on a vector space over K. Then XG.K 
is an eigenvalue of T if and only if the operator Ai — T is singular. The 
eigenspace of A. is then the kernel of A/ - T. 

Proof, x is an eigenvalue of T if and only if there exists a nonzero vector v such that 

T(v) = Xv or (Xl)(v)-T(v) = 0 or (Xl-T)(v) = 0 

i.e. XI — T is singular. We also have that v is in the eigenspace of A. if and only if the above 
relations hold; hence v is in the kernel of XI — T. 

We now sta f 3 a very useful theorem which we prove (Problem 9.14) by induction: 

Theorem 9.3: Nonzero eigenvectors belonging to distinct eigenvalues are linearly 
independent. 

Example 9.6: Consider the functions e" 1 ', e a%t , . . ., e"** where a u . . .,a n are distinct real numbers. 

If D is the differential operator then £>(e° kt ) = a k e akt . Accordingly, e"^, e""* 
are eigenvectors of D belonging to the distinct eigenvalues a u . . . , a n , and so, by 
Theorem 9.3, are linearly independent. 

We remark that independent eigenvectors can belong to the same eigenvalue (see 
Problem 9.7). 



DIAGONALIZATION AND EIGENVECTORS 

Let T : V -* V be a linear operator on a vector space V with finite dimension n. Note 
that T can be represented by a diagonal matrix 



/ki 


0 . 


. 0 


0 


kz . 


. . 0 


\o 


0 . 
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if and only if there exists a basis {v u . . .,v n ) of V for which 

T(vi) = kiVi 
T(v 2 ) - k 2 v 2 



T(v n ) — k„V n 



that is, such that the vectors v u . . . , v n are eigenvectors of T belonging respectively to eigen- 
values hi, k n . In other words: 

Theorem 9.4: A linear operator T : V -* V can be represented by a diagonal matrix B 
if and only if V has a basis consisting of eigenvectors of T. In this case 
the diagonal elements of B are the corresponding eigenvalues. 

We have the following equivalent statement. 

Alternate Form of Theorem 9.4: An n-square matrix A is similar to a diagonal matrix 

B if and only if A has n linearly independent eigen- 
vectors. In this case the diagonal elements of B are the 
corresponding eigenvalues. 

In the above theorem, if we let P be the matrix whose columns are the n independent 
eigenvectors of A, then B = P _1 AP. 

Example 9.7: Consider the matrix A — Q ^ . By Example 9.5, A has two independent 
eigenvectors Q and Set P = (J _J) , and so P~* = . 

Then A is similar to the diagonal matrix 

B - P ~ 1AP = ( 3 /5 -Wis -l) = (0 -l) 

As expected, the diagonal elements 4 and —1 of the diagonal matrix B are the eigen- 
values corresponding to the given eigenvectors. 

CHARACTERISTIC POLYNOMIAL, CAYLEY-HAMILTON THEOREM 

Consider an w-square matrix A over a field K: 



'■1 


fan 

d21 


012 . 
(I22 


. ain \ 

0,2n 




\a„i 


Ctn2 


Onn j 



The matrix tl n — A, where /„ is the w-square identity matrix and * is an indeterminant, is 
called the characteristic matrix of A: 

It — an — ai2 . . . — ain 

, T . — 021 t — 022 . . . — a2n 

tl n - A - 



~ttnl — a n 2 ... t~ ftnn 



Its determinant A A {t) — det (tl n — A) 

which is a polynomial in t, is called the characteristic polynomial of A. We also call 

A A (t) = det (th -A) = 0 

the characteristic equation of A. 
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Now each term in the determinant contains one and only one entry from each row and 
from each column; hence the above characteristic polynomial is of the form 

A A (t) = (t - dll)(t - dvt) • • -(t-Onn) 

+ terms with at most n — 2 factors of the form t — an 

Accordingly, 

A A (t) — t n — (an + a 2 2 + • • • + anr)t n ~ l + terms of lower degree 

Recall that the trace of A is the sum of its diagonal elements. Thus the characteristic 
polynomial A A (t) = det (tl n — A) of A is a monic polynomial of degree n, and the coefficient 
of is the negative of the trace of A. (A polynomial is monic if its leading coefficient is 1.) 

Furthermore, if we set t = 0 in A A (t), we obtain 

MO) = \-a\ = 

But A A (0) is the constant term of the polynomial A A (t). Thus the constant term of the char- 
acteristic polynomial of the matrix A is (—1)" \A\ where n is the order of A. 

l 3 

Example 9.8: The characteristic polynomial of the matrix A = I — 2 2 




A(i) = \tI-A\ = 



t - 1 -3 0 
2 t - 2 1 
-4 0 t + 2 



= <3 _ (2 + 2f + 28 



As expected, A(t) is a monic polynomial of degree 3. 

We now state one of the most important theorems in linear algebra. 
Cayley-Hamilton Theorem 9.5: Every matrix is a zero of its characteristic polynomial. 

/I 2\ 

Example 9.9 : The characteristic polynomial of the matrix A = I ^ ) is 



A(t) = \tI-A\ = 



t-l -2 
-3 t - 2 



= *2 - 3t - 4 



As expected from the Cayley-Hamilton theorem, A is a zero of A(t): 

= ly-a D-c :)-c. °) 

The next theorem shows the intimate relationship between characteristic polynomials 
and eigenvalues. 

Theorem 9.6: Let A be an n-square matrix over a field K. A scalar X G K is an eigen- 
value of A if and only if A is a root of the characteristic polynomial A(t) of A. 

Proof. By Theorem 9.2, A. is an eigenvalue of A if and only if XI — A is singular. 
Furthermore, by Theorem 8.4, XI — A is singular if and only if \xl — A\ — 0, i.e. A is a root 
of A(t). Thus the theorem is proved. 

Using Theorems 9.3, 9.4 and 9.6, we obtain 

Corollary 9.7: If the characteristic polynomial A(t) of an M-square matrix A is a product 
of distinct linear factors: 
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A(t) = (t-ai)(t-a 2 )- • - (t-a n ) 

i.e. if ai, . . . , On are distinct roots of A(t), then A is similar to a diagonal 
matrix whose diagonal elements are the <k. 

Furthermore, using the Fundamental Theorem of Algebra (every polynomial over C 
has a root) and the above theorem, we obtain 

Corollary 9.8: Let A be an n-square matrix over the complex field C. Then A has at 
least one eigenvalue. 



Example 9.10: Let A = 



0 
2 
1 

A(t) = 



We consider two cases: 



Its characteristic polynomial is 



t - 3 0 0 
0 t - 2 5 
0 -1 t + 2 



= (t-3)(f + l) 



(i) A is a matrix over the real field R. Then A has only the one eigenvalue 3. 
Since 3 has only one independent eigenvector, A is not diagonalizable. 

(ii) A is a matrix over the complex field C. Then A has three distinct eigenvalues: 
3, i and — i. Thus there exists an invertible matrix P over the complex field C 
for which 

^3 0 0 N 

P-iAP = 



i.e. A is diagonalizable. 



Now suppose A and B are similar matrices, say B = P~ 1 AP where P is invertible. We 
show that A and B have the same characteristic polynomial. Using tl = P~ 1 tIP, 

\tI-B[ = \tI-P~ l AP\ = \P-HIP -P~ l AP\ 

= \P- 1 (tI-A)P\ - IP -1 ! \tl - A\ \P\ 

Since determinants are scalars and commute, and since |P -1 | \P\ = 1, we finally obtain 

\tI-B\ = \tI-A\ 

Thus we have proved 

Theorem 9.9: Similar matrices have the same characteristic polynomial. 



MINIMUM POLYNOMIAL 

Let A be an ^-square matrix over a field K. Observe that there are nonzero polynomials 
/(£) for which f{A) — 0; for example, the characteristic polynomial of A. Among these 
polynomials we consider those of lowest degree and from them we select one whose leading 
coefficient is 1, i.e. which is monic. Such a polynomial m(t) exists and is unique (Problem 
9.25); we call it the minimum polynomial of A. 

Theorem 9.10: The minimum polynomial m(t) of A divides every polynomial which has A 
as a zero. In particular, m(t) divides the characteristic polynomial A(t) of A. 

There is an even stronger relationship between m(t) and A(t). 
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Theorem 9.11: The characteristic and minimum polynomials of a matrix A have the same 
irreducible factors. 

This theorem does not say that m(t) = A(t); only that any irreducible factor of one must 
divide the other. In particular, since a linear factor is irreducible, m(t) and A(t) have the 
same linear factors; hence they have the same roots. Thus from Theorem 9.6 we obtain 

Theorem 9.12: A scalar A is an eigenvalue for a matrix A if and only if A is a root of the 
minimum polynomial of A. 



Example 9.11: Find the minimum polynomial m(t) of the matrix A = 



The characteristic polynomial of A is A(t) = \tl — A\ = (t-2) s (t-5). By 
Theorem 9.11, both t — 2 and t — 5 must be factors of m(t). But by Theorem 9.10, 
m(t) must divide A(t); hence m(t) must be one of the following three polynomials: 

mi (t) = (*-2)(t-5), tn^t) = (t-2)2(t-5), m 3 (t) = (t- 2)8(4-5) 

We know from the Cayley-Hamilton theorem that m 3 (A) = A(A) = 0. The reader can 
verify that m^A) # 0 but m 2 (A) - 0. Accordingly, m^t) = (t-2) 2 (t — 5) is the 
minimum polynomial of A. 




Example 9.12: Let A be a 3 by 3 matrix over the real field R. We show that A cannot be a zero 
of the polynomial /(*) — t 2 + 1. By the Cayley-Hamilton theorem, A is a zero of 
its characteristic polynomial A(t). Note that A(t) is of degree 3; hence it has at least 
one real root. 

Now suppose A is a zero of f(t). Since f(t) is irreducible over R, f(t) must be 
the minimal polynomial of A. But f(t) has no real root. This contradicts the fact 
that the characteristic and minimal polynomials have the same roots. Thus A is not 
a zero of f(t). 

The reader can verify that the following 3 by 3 matrix over the complex 
field C is a zero of f(t): 




CHARACTERISTIC AND MINIMUM POLYNOMIALS OF 
LINEAR OPERATORS 

Now suppose T : V -» V is a linear operator on a vector space V with finite dimension. 
We define the characteristic polynomial A(i) of T to be the characteristic polynomial of any 
matrix representation of T. By Theorem 9.9, A(t) is independent of the particular basis in 
which the matrix representation is computed. Note that the degree of A(t) is equal to the 
dimension of V. We have theorems for T which are similar to the ones we had for matrices: 

Theorem 9.5': T is a zero of its characteristic polynomial. 

Theorem 9.6': The scalar AGK is an eigenvalue of T if and only if A is a root of the 
characteristic polynomial of T. 

The algebraic multiplicity of an eigenvalue A G K of T is defined to be the multiplicity 
of A as a root of the characteristic polynomial of T. The geometric multiplicity of the 
eigenvalue A is defined to be the dimension of its eigenspace. 

Theorem 9.13: The geometric multiplicity of an eigenvalue A does not exceed its algebraic 
multiplicity. 
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Example 9.13: Let V be the vector space of functions which has {sin 6, cos 9} as a basis, and let D 
be the differential operator on V. Then 



D(sin e) = cos 8 - 
D(cos 6) = — sin e 



0(sin e) + l(cos e) 
- -l(sine) + O(cose) 



The matrix A of D in the above basis is therefore A = [D] = ^ J ^ 

det(</-A) = 1 _1 = 
It 

and the characteristic polynomial of D is A(i) = i 2 + l. 



Thus 



+ 1 



On the other hand, the minimum polynomial m(t) of the operator T is defined independ- 
ently of the theory of matrices, as the polynomial of lowest degree and leading coefficient 1 
which has T as a zero. However, for any polynomial f(t), 

f(T) = 0 if and only if /(A) = 0 

where A is any matrix representation of T. Accordingly, T and A have the same minimum 
polynomial. We remark that all the theorems in this chapter on the minimum polynomial 
of a matrix also hold for the minimum polynomial of the operator T. 



Solved Problems 

POLYNOMIALS OF MATRICES AND LINEAR OPERATORS 
9.1. Find f(A) where A = ^ and f(t) = t 2 - 3t + 7. 



9.2. Show that A = ( * J ) is a zero of f(t) = t 2 - 4t - 5. 



/(A) = A 2 -4A-5/ = (I J)' -4 (J })-,( 



1 0\ _ /0 0 
0 1/ ~ U 0 



9.3. Let V be the vector space of functions which has {sin B, cos 0} as a basis, and let 
D be the differential operator on V. Show that D is a zero of /(£) = t 2 + 1. 

Apply /(£>) to each basis vector: 

/(D)(sin e) = (D 2 + 7)(sin e) = P 2 (sin 9) + /(sin (?) = -sin s + sin e = 0 
/(D)(cos ») = (I> 2 + /)(cos 9) = D 2 (cos e) + /(cos s) = -cos e + cos e = 0 

Since each basis vector is mapped into 0, every vector v S V is also mapped into 0 by f(D). Thus 
/(£>) = 0. 

This result is expected since, by Example 9.13, f(t) is the characteristic polynomial of D. 
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9.4. Let A be a matrix representation of an operator T. Show that /(A) is the matrix 
representation of f(T), for any polynomial f(t). 

Let <(> be the mapping T h* A, i.e. which sends the operator T into its matrix representation A. 
We need to prove that <f>(f(T)) - f(A). Suppose f(t) = a n t n + • ■ • + a x t + a 0 . The proof is by in- 
duction on n, the degree of /(t). 

Suppose n = 0. Recall that </>(I') = / where /' is the identity mapping and / is the identity 
matrix. Thus 

mT)) = *(««/') = avt>(I') = a 0 I = f(A) 

and so the theorem holds for n — 0. 

Now assume the theorem holds for polynomials of degree less than n. Then since <f> is an 
algebra isomorphism, 

*<J(T)) = + <»„_! r»-i + ■■■ + ai T + V) 

= a n <f>(T) + ^K^T"-! + • • • + a r T + a,,/') 

= Bn AA«-i + (a^A"-! + • • • + ai A + ao I) = f(A) 

and the theorem is proved. 



9.5. Prove Theorem 9.1: Let / and g be polynomials over K. Let A be a square matrix 
over K. Then: (i) (/ + g)(A) = /(A) + flr(A); (ii) (/flf)(A) = /(A) sr(A); and (iii) (kf)(A) = 
kf(A) where k G K. 

Suppose / = a n t n H 1- o t t + a 0 and g = b m t m H 1- 6^ + 6 0 . Then by definition, 

/(A) = a n A» + • • • + a t A + aj and g(A) = 6 m A™ + • • • + 6,A + b 0 I 

(i) Suppose m — n and let 6 S = 0 if i > m. Then 

/ + = (a B +6„)*"+ ••• + (a 1 + 6 1 )t + (a 0 +&o) 

Hence 

(/ + flO(A) = K + &n)4 n +•••+(«!+ 6l)A + K + &„)/ 

= a„A» + &„A« + • • • + a x A + &jA + + &„/ = /(A) + ff(A) 

n+m 

(ii) By definition, fg = c n+m t n+m + • • • + c x t + c 0 = 2 c k t k where c k = a 0 b k + a 1 b k - 1 + 

k = 0 

k n+m 

••• + Ofc&o = 2 Oi&k-i- Hence (fg)(A) = 2 c^A* an d 

i=0 k=0 

(n \ / m \ n m n + m 

2 OiAMf 2MM = 2 2<*M i+i = 2 c k A« = (f g )(A) 

(iii) By definition, kf = fco n t" + • • • + fta t * + ka 0 , and so 

(fc/)(A) = ka n A* + •■• + ka x A + ka 0 I = ft(a„A» + • • • + a t A + Oq/) = fc /(A) 



EIGENVALUES AND EIGENVECTORS 

9.6. Let A. be an eigenvalue of an operator T : V -*■ V. Let Vx denote the set of all eigen- 
vectors of T belonging to the eigenvalue A. (called the eigenspace of A.). Show that 
Fx is a subspace of V. 

Suppose v, w G V x ; that is, T(v) = \v and T{w) = Xw. Then for any scalars a,b & K, 

T(av + bw) = a T(v) + 6 T(w) - a(\v) + b{\w) = \(av + bw) 

Thus <m> + bw is an eigenvector belonging to X, i.e. au + bw G V x . Hence V x is a subspace of V. 
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9.7. Let A = f g J . (i) Find all eigenvalues of A and the corresponding eigenvectors, 
(ii) Find an invertible matrix P such that P^AP is diagonal, 
(i) Form the characteristic matrix tl — A of A: 

-*-(::)-(i:) = ('-.-.) 

The characteristic polynomial A(t) of A is its determinant: 



A(«) = |t/-A| = 



t- 1 -4 
-2 t - 3 



The roots of A(t) are 5 and —1, and so these numbers are the eigenvalues of A. 

We obtain the eigenvectors belonging to the eigenvalue 5. First substitute t = 5 into 
the characteristic matrix (1) to obtain the matrix ( 4 4 ). The eigenvectors belonging to 



,-2 2, 

5 form the solution of the homogeneous system determined by the above matrix, i.e., 



/ 4 -4\/zN /0\ J 4x-4y = 0 

V-2 2j\y) ~ \0j or \-2x + 2y = 0 



* - y = 0 



(In other words, the eigenvectors belonging to 5 form the kernel of the operator tl - A for 
t = 5.) The above system has only one independent solution; for example, x = 1, y = 1. Thus 
« = (1> 1) is an eigenvector which generates the eigenspace of 5, i.e. every eigenvector belong- 
ing to 5 is a multiple of v. 

We obtain the eigenvectors belonging to the eigenvalue —1. Substitute t = — 1 into (1) 
to obtain the homogeneous system 



-2 -4)(i/) (0) 



\—2« - Ay - 0 

- |_ 2a; _ % = o or * + 2 f = ° 



The system has only one independent solution; for example, x = 2, y = —1. Thus w = (2, —1) 
is an eigenvector which generates the eigenspace of —1. 

(ii) Let P be the matrix whose columns are the above eigenvectors: P = ( ) . Then 



1 -1, 

.B = P~ 1 AP is the diagonal matrix whose diagonal entries are the respective eigenvalues: 

0 s 



B = P-iAP = 



/1/3 2/3X/1 4N/1 2 \ _ /5 0 
Vl/3 -1/3/(2 3 X 1 - 1 / ~ V» -1 



{Remark. Here P is the transition matrix from the usual basis of R 2 to the basis of eigen- 
vectors {v, w}. Hence B is the matrix representation of the operator A in this new basis.) 



9.8. For each matrix, find all eigenvalues and a basis of each eigenspace: 

-1 





ll 


-3 


3^ 




1-3 


1 


A = 


3 


-5 


3 


(ii) B = 


-7 


5 




\6 


-6 


4J 




I" 6 


6 



Which matrix can be diagonalized, and why? 



(i) Form the characteristic matrix tl — A and compute its determinant to obtain the character- 
istic polynomial A(t) of A: 

' t - 1 3 -3 
-3 t + 5 -3 
-6 6 t - 4 



A(t) = \tl-A\ = 



(t + 2)»(f-4) 



The roots of A(t) are —2 and 4; hence these numbers are the eigenvalues of A. 
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We find a basis of the eigenspace of the eigenvalue —2. Substitute t = — 2 into the char- 
acteristic matrix tl — A to obtain the homogeneous system 




-3x + 3y - 3z = 0 
-3x + 3y - 3z = 0 
-6x + 6y - 6z = 0 



y + z = 0 



The system has two independent solutions, e.g. x = 1, y = 1, z = 0 and x = 1, y = 0, z = — 1. 
Thus it = (1, 1, 0) and v = (1, 0, —1) are independent eigenvectors which generate the eigen- 
space of —2. That is, u and v form a basis of the eigenspace of —2. This means that every 
eigenvector belonging to —2 is a linear combination of u and v. 

We find a basis of the eigenspace of the eigenvalue 4. Substitute t = 4 into the char- 
acteristic matrix tl — A to obtain the homogeneous system 

















- 1 












\x + y — z = 0 
\ 2y - z = 0 



The system has only one free variable; hence any particular nonzero solution, e.g. x = 1, y = 1, 
z = 2, generates its solution space. Thus w = (1, 1, 2) is an eigenvector which generates, and 
so forms a basis, of the eigenspace of 4. 

Since A has three linearly independent eigenvectors, A is diagonalizable. In fact, let P 
be the matrix whose columns are the three independent eigenvectors: 



P = 



As expected, the diagonal elements of P~ l AP are the eigenvalues of A corresponding to the 
columns of P. 






1-2 


0 


0 


Then P'^AP = 


0 


-2 


0 




\ o 


0 





(ii) A(f) = \a-B\ - 

The eigenvalues of B are therefore —2 and 4. 



t + 3 -1 
7 t - 5 
6 -6 



1 
1 

t + 2 



= (t + 2)2(t-4) 



We find a basis of the eigenspace of the eigenvalue —2. Substitute t = — 2 into tl — B 
to obtain the homogeneous system 



-1 


l\ 






/o 


-7 


1 




i-i 


0 


-6 


0/ 









a; — y + z = 0 
7a; - 7y + z = 0 
.Bx - 6y - 0 



2/ + z = 0 
y =0 



The system has only one independent solution, e.g. x 
forms a basis of the eigenspace of —2. 



1, y = 1, z = 0. Thus m = (1, 1, 0) 



We find a basis of the eigenspace of the eigenvalue 4. Substitute t — 4 into <7 —B to 
obtain the homogeneous system 








fo 


y 


\- 


0 









y + z = 0 
2/ + z = 0 
63/ + 6z = 0 



; 7X — 



+ z = 0 
a; = 0 



The system has only one independent solution, e.g. x 
forms a basis of the eigenspace of 4. 



0, y = 1, z - 1. Thus -v = (0, 1, 1) 



Observe that B is not similar to a diagonal matrix since B has only two independent 
eigenvectors. Furthermore, since A can be diagonalized but B cannot, A and B are not similar 
matrices, even though they have the same characteristic polynomial. 
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9.9. Let A = 



(i) 



A A (t) = \tI-A\ = 



Find all eigenvalues and the corresponding 



.; ~\) - - - (; 

eigenvectors of A and Z? viewed as matrices over (i) the real field R, (ii) the complex 
field C. 

' t - 3 l 

-l t - 1 



Hence only 2 is an eigenvalue. Put t 



(:: X) ■ C) 



+ 4 = (t-2) 2 
2 into i/ — A and obtain the homogeneous system 

x — y = 0 



or 



— x + y = 0 
—x + y = 0 



The system has only one independent solution, e.g. x = 1, y = 1. Thus v = (1, 1) is an eigen- 
vector which generates the eigenspace of 2, i.e. every eigenvector belonging to 2 is a multiple 
of v. 

We also have 



A B (t) = |t/-/J| 



t - 1 
-2 



1 

t + 1 



t 2 + 1 



Since « 2 + 1 has no solution in R, Z? has no eigenvalue as a matrix over R. 

(ii) Since A A (t) = (t — 2) 2 has only the real root 2, the results are the same as in (i). That is, 
2 is an eigenvalue of A, and v = (1,1) is an eigenvector which generates the eigenspace of 2, 
i.e. every eigenvector of 2 is a (complex) multiple of v. 

The characteristic matrix of B is A a (t) = \tl — B\ = t 2 + 1. Hence i and — i are the eigen- 
values of B. 

We find the eigenvectors associated with t = i. Substitute t = i in tl — B to obtain the 
homogeneous system 



i - 1 
-2 



1 

i + 1 



■C) » ( 



(i - l)x + y = 0 
■2x + (i + l)y = 0 



The system has only one independent solution, e.g. x — 1, y — 1 • 
an eigenvector which generates the eigenspace of i. 



(i -\)x + y = 0 
Thus w = (1, 1 — i) is 



Now substitute t = —i into tl — B to obtain the homogeneous system 



(— i— 1)* + y = 0 
-2a; + (—i -l)y - 0 



or 



(—i - l)x + y = 0 



The system has only one independent solution, e.g. x = 1, 3/ = 1 + i. Thus w' = (1,1 + 1) is 
an eigenvector which generates the eigenspace of — i. 



9.10. Find all eigenvalues and a basis of each eigenspace of the operator T : R 3 -* R 3 defined 
by T(x, y, z) = (2x + y,y- z, 2y + 4z). 

First find a matrix representation of T, say relative to the usual basis of R 3 : 

1% 1 0^ 

A = [T\ = 




The characteristic polynomial A(£) of 2" is then 

t-2 -1 0 

A(t) = \tI-A\ = 0 t-1 1 

0 -2 t - 4 



= (*-2) 2 (t-3) 



Thus 2 and 3 are the eigenvalues of T. 

We find a basis of the eigenspace of the eigenvalue 2. Substitute t = 2 into t/ — A to obtain 
the homogeneous system 

/„\ /«\ f - n 

j/ =0 
y + z = 0 
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The system has only one independent solution, e.g. x = 1, y = 0, z = 0. Thus u = (1, 0, 0) forms 
a basis of the eigenspace of 2. 

We find a basis of the eigenspace of the eigenvalue 3. Substitute t = 3 into tl — A to obtain 
the homogeneous system 

(0 2 l]Lj=(o] or \ 2y + z = 0 or { X ~ V = ° 

The system has only one independent solution, e.g. x — 1, y — 1, a = —2. Thus v = (1, 1, -2) 
forms a basis of the eigenspace of 3. 

Observe that T is not diagonalizable, since T has only two linearly independent eigenvectors. 



9.11. Show that 0 is an eigenvalue of T if and only if T is singular. 

We have that 0 is an eigenvalue of T if and only if there exists a nonzero vector v such that 
T(v) = Ov - 0, i.e. that T is singular. 



9.12. Let A and B be n-square matrices. Show that AB and Z?A have the same eigenvalues. 

By Problem 9.11 and the fact that the product of nonsingular matrices is nonsingular, the fol- 
lowing statements are equivalent: (i) 0 is an eigenvalue of AB, (ii) AB is singular, (iii) A or B is 
singular, (iv) BA is singular, (v) 0 is an eigenvalue of BA. 

Now suppose X is a nonzero eigenvalue of AB. Then there exists a nonzero vector v such that 
ABv = \v. Set w — Bv. Since X ¥> 0 and v # 0, 

Aw = ABv = \v 0 and so w 0 

But w is an eigenvector of BA belonging to the eigenvalue X since 

BAw = BABv = BXv = XBv = \w 

Hence X is an eigenvalue of BA. Similarly, any nonzero eigenvalue of BA is also an eigenvalue 
of AB. 

Thus AB and BA have the same eigenvalues. 



9.13. Suppose A is an eigenvalue of an invertible operator T. Show that A. -1 is an eigenvalue 
of T-K 

Since T is invertible, it is also nonsingular; hence by Problem 9.11, X ¥• 0. 

By definition of an eigenvalue, there exists a nonzero vector v for which T(v) — \v. Apply- 
ing 7 7-1 to both sides, we obtain v = T~ l {Kv) = \ T~ l (v). Hence T- 1 ^) = X" 1 !;; that is, X -1 
is an eigenvalue of 



9.14. Prove Theorem 9.3: Let vi, . . ., v n be nonzero eigenvectors of an operator T : V -* V 
belonging to distinct eigenvalues Ai, . . . , K. Then Vi, . . . , v„ are linearly independent. 

The proof is by induction on n. If n = 1, then v t is linearly independent since v t ¥* 0. 
Assume n > 1. Suppose 

ajV, + o 2 i; 2 + • • • + a n v„ = 0 (7) 
where the o f are scalars. Applying T to the above relation, we obtain by linearity 
a 1 7'(i; 1 ) + a 2 T(v 2 ) + ■■■ + a n T{v n ) = T(0) = 0 

But by hypothesis T(v t ) = X^; hence 

ctiX^! + asjXjjVjs + • • • + a„X„v B = 0 (2) 
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On the other hand, multiplying (1) by X„, 

o-iKVi + «2V2 + • • • + a„\if „ = 0 ($) 

Now subtracting (3) from (2), 

<*i(Xi - K) v i + <h.(^2 — \d v 2 + " • • + a n -AK-i - K) v n-i = 0 

By induction, each of the above coefficients is 0. Since the \ { are distinct, X 4 — X„ ¥= 0 for i ¥> n. 
Hence d = • • • = = 0. Substituting this into (1) we get a n v n = 0, and hence a n = 0. Thus 
the v t are linearly independent. 



CHARACTERISTIC POLYNOMIAL, CAYLEY-HAMILTON THEOREM 
9.15. Consider a triangular matrix 



A = 



Ian a 12 
0 a 22 



0 0 ... a™ y 

Find its characteristic polynomial A(t) and its eigenvalues. 

Since A is triangular and tl is diagonal, tl - A is also triangular with diagonal elements t — a H : 

(t - a u —a 12 ■ ■ ■ -Oin \ 
0 t — a 22 ... -a 2n 
0 0 ... t-a nn j 

Then A(t) = \tl — A\ is the product of the diagonal elements t — a u : 

A(t) = (t-a n )(t-a 22 )- ••(«-»„„) 
Hence the eigenvalues of A are a n , 022, • • • , «*,«> i-e. its diagonal elements 



9.16. Let A = 



1 2 8\ 

0 2 3 . Is A similar to a diagonal matrix? If so, find one such matrix. 

,0 0 3/ 

Since A is triangular, the eigenvalues of A are the diagonal elements 1, 2 and 3. Since they 
are distinct, A is similar to a diagonal matrix whose diagonal elements are 1, 2 and 3; for example, 

'l 0 
0 2 0 
i0 0 3, 



9.17. For each matrix find a polynomial having the matrix as a root: 



2 5 



2 -3 





!l 4 


-3\ 


(iii) c = 


0 3 


1 




\0 2 


-1/ 



By the Cayley-Hamilton theorem every matrix is a root of its characteristic polynomial. 
Therefore we find the characteristic polynomial A(t) in each case. 

(i) A(t) = \a-A\ 

(ii) A(t) - \tI-B\ 



(iii) A(t) = \a-c\ = 



t - 2 
-1 


-5 
t + 3 


= t 2 + t - 11 


* - 2 
-7 


3 

t + 4 


= t 2 


+ 2t + 13 


t - 1 


-4 


3 




0 


t - 3 


-1 


- («-l)(* 2 


0 


-2 


t + 1 
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9.18. Prove the Cayley-Hamilton Theorem 9.5: Every matrix is a zero of its characteristic 
polynomial. 

Let A be an arbitrary w-square matrix and let A(t) be its characteristic polynomial; say, 

A(t) = \tI-A\ = *» + a„_it"- 1 + •■• H-Ojt + Oo 

Now let Bit) denote the classical adjoint of the matrix tl — A. The elements of B(t) are cof actors 
of the matrix tl — A and hence are polynomials in t of degree not exceeding n — 1. Thus 

B(t) = £„_!*»-! + • • • + Bib + B 0 

where the B { are re-square matrices over K which are independent of t. By the fundamental property 
of the classical adjoint (Theorem 8.8), 

(tI-A)B(t) = \tI-A\I 
or (i/-A)(fl„_it»-i+ ••• + B 1 t + B 0 ) = (<» + o B _ 1 t»- 1 + ••• +a 1 t + <t 0 )7 

Removing parentheses and equating the coefficients of corresponding powers of t, 



*„-! = 


I 






a„_ 




B n-3 ~ AB n-2 = 


a„_ 




Ba-ABi = 


a x I 




-AB 0 = 


a 0 I 





Multiplying the above matrix equations by A n , A" -1 , ...,A,I respectively, 

A»B B _! = A n 
A"-iB„_ 2 - A»B n _! = On-iA"-! 
A«-2B n _3 - A«-i£ n _ 2 = o n _ 2 A— 2 

AB 0 — A*Bi - a t A 
—AB 0 = a Q I 

Adding the above matrix equations, 

0 = A" + o„_ 1 A»-i + • • • + e^A + a„J 
In other words, A(A) = 0. That is, A is a zero of its characteristic polynomial. 



9.19. Show that a matrix A and its transpose A* have the same characteristic polynomial. 

By the transpose operation, (tJ — A)' = tl* — A* = tl — A*. Since a matrix and its transpose 
have the same determinant, \tl — A\ — \(tl — A)*| = \tl — A*|. Hence A and A* have the same char- 
acteristic polynomial. 



(At B\ 

9.20. Suppose M = I Q ^ 1 where Ai and A 2 are square matrices. Show that the char- 
acteristic polynomial of M is the product of the characteristic polynomials of Ai and 
Ai. Generalize. 



, , ftl-A, —B \ 

tl-M = I Q a — A/' Hence by P"* 1 *™ 8 - 70 » \ti-M\ - 
\tl — A\ \tl — B\, as required. 

By induction, the characteristic polynomial of the triangular block matrix 



tl - A-l -B 
0 tl — A 
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M 



Ax 
0 



B 
A 2 



D 



0 0 ... Aj 

where the A 4 are square matrices, is the product of the characteristic polynomials of the A t . 



MINIMUM POLYNOMIAL 

9.21. Find the minimum polynomial m(t) of A = 
The characteristic polynomial of A is 



/ 2 

' 0 



1 
2 

0 0 



0 
0 

1 

-2 



°\ 

0 

1 



A(t) = 



t-2 -1 0 0 

0 t-2 0 0 

0 0 t - 1 -1 

0 0 2 t-4 



t-2 -1 
0 t-2 



t - 



1 -1 
t - 4 



= (t-3)(t-2) 3 



The minimum polynomial m(t) must divide A(«). Also, each irreducible factor of A(t), i.e. t — 2 
and t — 3, must be a factor of m(t). Thus m(i) is exactly one of the following: 



We have 



f{A) = (A-SI)(A-2I) = 



= (t- 


- 3)(t - 2)2 




Ht) 




(f- 




1 0 


°\ 


1° 


1 


0 




-1 0 


0 


0 


0 


0 


("i 


0 -2 


1 


0 


0 


-1 




0 -2 


1/ 


\o 


0 


-2 


/- 1 


1 0 




/o 


1 


0 


0 


-1 0 




0 


0 


0 


0 


0 -2 


;) 


0 


0 


-1 


\ o 


0 -2 




\o 


0 


-2 



^ 0 



ff (A) = (A — 3/) (A — 2Z) 2 = 1=0 



Thus fl'(t) = (t — 3)(t — 2) 2 is the minimum polynomial of A. 

Remark. We know that h(A) — \(A) — 0 by the Cayley-Hamilton theorem. However, the degre 
of g(t) is less than the degree of h(t); hence g(t), and not h(t), is the minimum polynomial of A. 




9.22. Find the minimal polynomial m(t) of each matrix (where a 0): 

fx. a 0 0\ 

f\ a 0\ 

(i)A =( J ^, (ii)S = (o A a), (iii) C = 

(i) The characteristic polynomial of A is A(t) = (t — X) 2 . We find A — X/ # 0; hence m(t) = 
A(t) = (t - X) 2 . 

(ii) The characteristic polynomial of B is A(i) = (t — X) 3 . (Note m(t) is exactly one of t — X, (t — X) 2 
or (t - X) 3 .) We find (B - X/) 2 ^ 0; thus m(t) = A(t) = (t - X) s . 

(iii) The characteristic polynomial of C is A(t) = (t-X) 4 . We find (C-X/) 3 #0; hence m(t) = 
A(t) = (t-X)4. 
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(A 0\ 

9.23. Let M = I Q g j where A and B are square matrices. Show that the minimum 

polynomial m(f) of M is the least common multiple of the minimum polynomials g(t) 
and h(t) of A and B respectively. Generalize. 

/m(A) 0 \ 

Since m(t) is the minimum polynomial of M, m(M) = I Q m ^ J = 0 and hence m(A) = 0 

and m(B) = 0. Since g{t) is the minimum polynomial of A, g(t) divides m(t). Similarly, h(t) 
divides m(t). Thus m(t) is a multiple of g(t) and 7i(t). 

//(A) 0\ /0 0\ 
Now let /(*) be another multiple of g(t) and h(t); then /(AT) = I Q /(B)/ = I 0 0/ = °" 

But »i(t) is the minimum polynomial of M; hence m{t) divides f(t). Thus m(t) is the least common 
multiple of g(t) and h(t). 

We then have, by induction, that the minimum polynomial of 

! 0 ... 0 

M = | y) A 2 ... 0 



0 0 ... i4 

where the A { are square matrices, is the least common multiple of the minimum polynomials of 
the A,. 



9.24. Find the minimum polynomial m(t) of 



M = 



2 


8 


0 


0 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


0 


0 


4 


2 


0 


0 


0 


0 


0 


1 


3 


0 


0 


0 


0 


0 


0 


0 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


5 



Let A = 



^, B = ^ j g^, C=^* ^o) ' ^ = '^' le mm ' mum polynomials of 



A, C and U are (t — 2) 2 , t 2 and t — 5 respectively. The characteristic polynomial of 2? is 



\tI-B\ = 



t-4 -2 1 
-1 t - 3 



- 7t + 10 = (t-2)(t-5) 



and so it is also the minimum polynomial of B. 



Observe that M — 




Thus m(t) is the least common multiple of the minimum 



polynomials of A, B, C and D. Accordingly, m(t) = t 2 (t — 2) 2 (t — 5). 



9.25. Show that the minimum polynomial of a matrix (operator) A exists and is unique. 

By the Cayley-Hamilton theorem, A is a zero of some nonzero polynomial (see also Problem 9.31). 
Let n be the lowest degree for which a polynomial /(*) exists such that /(A) = 0. Dividing f(t) by 
its leading coefficient, we obtain a monic polynomial m(t) of degree n which has A as a zero. Sup- 
pose m'(t) is another monic polynomial of degree n for which m'(A) = 0. Then the difference 
m(t) — m'(t) is a nonzero polynomial of degree less than n which has A as a zero. This contradicts 
the original assumption on n; hence m(t) is a unique minimum polynomial. 
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9.26. Prove Theorem 9.10: The minimum polynomial m(t) of a matrix (operator) A 
divides every polynomial which has A as a zero. In particular, m(t) divides the char- 
acteristic polynomial of A. 

Suppose /(f) is a polynomial for which /(A) = 0. By the division algorithm there exist poly- 
nomials q(t) and r(f) for which f(t) = m(t) q(t) + r(f) and r(t) = 0 or deg r(t) < deg m(t). Sub- 
stituting t = A in this equation, and using that f(A) = 0 and m(A) = 0, we obtain r(A) = 0. If 
r(f) # 0, then r(f) is a polynomial of degree less than ra(t) which has A as a zero; this contradicts 
the definition of the minimum polynomial. Thus r(f) = 0 and so /(f) = m(f) q(t), i.e. m(t) divides f(t). 

9.27. Let m(t) be the minimum polynomial of an %-square matrix A. Show that the char- 
acteristic polynomial of A divides (m(t)) n . 

Suppose m(t) = f + c 1 t r ~ 1 + • • • + c r _i< + c r . Consider the following matrices: 

Bo = 1 
B x = A + ej 
B. 2 = A2 + e x A + c 2 I 

B r _! = A'-i + CjA'-a + • ■ ■ + c r -J 

Then B 0 = 7 

B t - AB 0 = c x 7 
B 2 -AB l = c 2 I 

Br- 1 ~ AB r _ 2 = C r -1^ 
Also, -AJ5 r _! = cj - (Ar + Cl Ar-i+ ••• +c r _ 1 A + c r /) 

= cj — m(A) 
— c,J 

Set B(t) = f- 1 B 0 + f-2B, + ••• + tF r _ 2 + B r _! 

Then 

(f/-A)-J3(f) = (fB 0 +f r - 1 Bi + ■•■+«,_!) - (f-iA£ 0 +t'- 2 AB 1 + •••+AB r _ 1 ) 

= FB 0 + f- 1 (Bi--AB 0 ) + * r_2 (-B 2 — AB X ) + ••• + t(B r _ 1 — AB r _ 2 ) ~ AB r -i 
= Pi + cjf- 1 / + c 2 f- 2 / + • • • + c r _i*7 + c,J 
= m(«)Z 

The determinant of both sides gives \tl — A\ \B(t)\ = \m(t) I\ = (w(t))». Since |£(f)| is a polynomial, 
\tI — A\ divides (m(f)) n ; that is, the characteristic polynomial of A divides (m(t)) n . 

9.28. Prove Theorem 9.11: The characteristic polynomial A(t) and the minimum poly- 
nomial m(t) of a matrix A have the same irreducible factors. 

Suppose /(f) is an irreducible polynomial. If /(f) divides m(f) then, since m(t) divides A(f), /(f) 
divides A(t). On the other hand, if /(f) divides A(f) then, by the preceding problem, /(f) divides 
(m(t)) n . But /(t) is irreducible; hence /(*) also divides m(t). Thus m(t) and A(f) have the same 
irreducible factors. 

9.29. Let T be a linear operator on a vector space V of finite dimension. Show that T is 
invertible if and only if the constant term of the minimal (characteristic) polynomial 
of T is not zero. 

Suppose the minimal (characteristic) polynomial of T is /(f) = f r + a n _ 1 t r_ i + • • • + a^t + a 0 . 
Each of the following statements is equivalent to the succeeding one by preceding results: (i) T is 
invertible; (ii) T is nonsingular; (iii) 0 is not an eigenvalue of T; (iv) 0 is not a root of m(f); (v) the 
constant term a 0 is not zero. Thus the theorem is proved. 
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9.30. Suppose dimV = n. Let T.V^V be an invertible operator. Show that T~ l is 
equal to a polynomial in T of degree not exceeding n. 

Let m(t) be the minimal polynomial of T. Then m(t) = t r + a^f-^ + • • • + a t t + a 0 , where 
r — n. Since T is invertible, a 0 ¥= 0. We have 

m(T) = T* + a^^-i + ■■■ + 0l T + a 0 / = 0 

Hence 

_i.( r ,- 1 + ar _ ir r-2 + ... +ai/ )j. = J and T -l = _ J_ (r r-l + clr _ ir r-2 + ... + a J) 



MISCELLANEOUS PROBLEMS 

9.31. Let T be a linear operator on a vector space V of dimension n. Without using the 
Cayley-Hamilton theorem, show that T is a zero of a nonzero polynomial. 

Let N - n 2 . Consider the following N+ 1 operators on V: I, T, T 2 , T N . Recall that the 
vector space A(V) of operators on V has dimension N — n 2 . Thus the above N + l operators are 
linearly dependent. Hence there exist scalars a, 6 ,a lt . ..,a N for which a N T N + ■■■ + a^T + a 0 I = 0. 
Accordingly, T is a zero of the polynomial f(t) = a N t N + • ■ • + a x t + a 0 . 



9.32. Prove Theorem 9.13: Let A be an eigenvalue of an operator T : V -» V. The geometric 
multiplicity of X does not exceed its algebraic multiplicity. 

Suppose the geometric multiplicity of X is r. Then X contains r linearly independent eigenvectors 
v lt ...,v r . Extend the set {v^ to a basis of V: {v lt . . ., v r , w u . . ., w s }. We have 

T( Vl ) = \v t 
T(v 2 ) = \v 2 

T(v r ) = \v r 

TiwJ = a n v 1 + • ■ ■ + a lr v r + & n W! + • • • + b u w s 
T(w 2 ) = a 21 v x + • ■ • + a 2r v r + b^w^ + • • • + b 2s w s 



T(w s ) = a sl v t + 
The matrix of T in the above basis is 



+ a„v r + b slWl + 



M = 



X 0 . . 


. 0 


j °11 a 21 • 


■■ "sl\ 


ox .. 


. 0 


1 <*12 a 22 

J 


■ "s2 \ 


0 0.. 


. X 


a lr «2r • 


■ a sr 


0 0.. 


0 


] hi hi ■ 


■ hi 


0 0 


. 0 


1 hz h 22 ■ 


■ hz 






J 




0 0.. 


0 


1 hs hus ■ 


■ hs/ 



+ b ss w s 



x/ r 



where A = (ay)' and B = (fty)'. 

By Problem 9.20 the characteristic polynomial of X/„ which is (f — X) r , must divide the char- 
acteristic polynomial of M and hence T. Thus the algebraic multiplicity of X for the operator T is 
at least r, as required. 



9.33. Show that A = ^ * * ^ is not diagonalizable. 

The characteristic polynomial of A is A(t) = (t — l) 2 ; hence 1 is the only eigenvalue of A. We 
find a basis of the eigenspace of the eigenvalue 1. Substitute t = 1 into the matrix tl — A to obtain 
the homogeneous system 
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The system has only one independent solution, e.g. x = 1, y = 0. Hence u - (1, 0) forms a basis 
of the eigenspace of 1. 

Since A has at most one independent eigenvector, A cannot be diagonalized. 



9.34. Let F be an extension of a field K. Let A be an %-square matrix over K. Note that 

.a. a. 

A may also be viewed as a matrix A over F. Clearly \tl~ A\ = |*/- A|, that is, A 
and A have the same characteristic polynomial. Show that A and A also have the 
same minimum polynomial. 

Let m(t) and m'(t) be the minimum polynomials of A and A respectively. Now m'(t) divides 
every polynomial over F which has A as a zero. Since m(t) has A as a zero and since m(t) may be 
viewed as a polynomial over F, m'(t) divides m(t). We show now that m(t) divides m'(t). 

Since m'(t) is a polynomial over F which is an extension of K, we may write 
m'(t) = /!(«)&! + f 2 (t)b 2 + ■■■ + f n (t)b n 

where / ; (t) are polynomials over K, and 6 t 6 n belong to F and are linearly independent over K. 

We have , , 

m'(A) = AfA)^ + / 2 (A)6 2 + ■■• + / n (A)6„ - 0 (Jl) 

Let a^P denote the y-entry of f k (A). The above matrix equation implies that, for each pair 

a" 1 *, + «l j 2) 6 2 + • • ■ + 4 n) 6n = 0 

Since the 6 ( are linearly independent over K and since the a*]" £ K, every = 0. Then 

h(A) = 0, / 2 (A) = 0, .... /»(A) = 0 

Since the are polynomials over K which have A as a zero and since m{t) is the minimum poly- 
nomial of A as a matrix over K, m(t) divides each of the /;(()• Accordingly, by (1), m(t) must also 
divide m'(t). But monic polynomials which divide each other are necessarily equal. That is, 
m(t) — m'(t), as required. 



9.35. Let {vi, . . . , v n } be a basis of V. Let T:V-*V be an operator for which T(vi) = 0, 
T(v2) = 021^1, T(vs) — asivi + a 3 2V2, . . ■, T(v„) = a n iVi + • • • + a n ,n-iv n -i. Show that 
T n = 0. 

It suffices to show that 

Ti(vj) = 0 (*) 

for j — 1, . . . , n. For then it follows that 

rn(t)j) = T«-'(Ti(Vj)) = T n ~i(0) =0, for 3 = 1 n 

and, since {v v . . . , v„} is a basis, T n = 0. 

We prove (*) by induction on j. The case j = 1 is true by hypothesis. The inductive step 
follows (for j = 2, . . . , n) from 

Ti( V} ) = Ti-HTiVj)) = T'~Ha n v 1 + ■■■ +a u _ 1 v j ^ l ) 

= a n Ti-H Vl )+ ■■■ +o j , 3 _ 1 ri-i( Vj _ 1 ) 

= a n 0 + ••• + aj, 3 -i0 = 0 

Remark. Observe that the matrix representation of T in the above basis is triangular with 
diagonal elements 0: 

/o o 21 o 31 ... a„i \ 
0 0 a 32 ... a n2 



\ 0 0 0 ... a„,„-i / 
\o 0 0 ... 0 / 
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Supplementary Problems 

POLYNOMIALS OF MATRICES AND LINEAR OPERATORS 

9.36. Let /(() = 2*2 - 5t + 6 and g(t) = P - 2V + t + 3. Find f(A), g(A), /(B) and g(B) where 



* - C 1 



and 5 = 



9.37. Let r:R2^R2 be defined by r(*, j/) = (x + j/, 2s). Let f{t) = t* - 2t + 3. Find f(T)(x,y). 

9.38. Let T 7 be the vector space of polynomials u(x) = ax 2 + bx + c. Let D : V -» V be the differential 
operator. Let /(t) = t 2 + 2« - 5. Find /(Z>)Mx)). 



9.39. Let A 



-G !)• 



Find A2, A3, A". 



'8 12 0" 
9.40. Let i? •-- | 0 8 12 

0 8y 



Find a real matrix A such that B = A 3 . 



9.41. Consider a diagonal matrix M and a triangular matrix N: 

/a, 0 ... 0 



M = 



0 a 2 ... 0 



and 



2V 



c 
d 



\ 0 0 ... a n 

Show that, for any polynomial /(f), f(M) and f(N) are of the form 

//(<Xi) 0 ... 0 \ 
0 f(a 2 ) ... 0 



/(AO = 



and 



/(AO = 



0 o 2 
0 0 



/(a,) x 
0 /(a 2 ) . 



/(«„)/ 



0 



0 



9.42. Consider a block diagonal matrix M and a block triangular matrix N: 







0 . 


• 0 \ 






Mi 


B 


M = 


0 


A 2 . 


. 0 


and 


N = 


0 








0 . 








\o 


0 



c 



y 

z 



/(<»n) | 



where the A i are square matrices. Show that, for any polynomial /((), f(M) and f(N) are of the form 

'/(A t ) 0 ... 0 \ //(Aj) X ... Y \ 

0 /(^2) ••• 0 j ^ , fAn = [ 0 /(A 2 ) ... Z 



/(If) = 



f(N) = 



0 



0 



/(AJ , 



/(A „) / 



9.43. Show that for any square matrix (or operator) A, (P _1 AP) n = P~ 1 A n P where P is invertible. More 
generally, show that /(P"UP) = p- 1 /(A)P for any polynomial f(t). 

9.44. Let f(t) be any polynomial. Show that: (i) /(A*) = (/(A))<; (ii) if A is symmetric, i.e. A* = A, 
then /(A) is symmetric. 

EIGENVALUES AND EIGENVECTORS 

9.45. For each matrix, find all eigenvalues and linearly independent eigenvectors: 

»"-(: :)■ » •-(: :)• »»> <-gd 

Find invertible matrices P 1( P 2 and P 3 such that P-iAP^ P- 1 BP 2 and P~ 1 CP 3 are diagonal. 
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9.46. For each matrix, find all eigenvalues and a basis for each eigenspace: 

Iz 1 l\ / 1 2 2\ 

(i) A = 2 4 2 , (ii) B = 1 2 -1 , (iii) C 
\1 1 3/ \-l 1 4/ 



/i i o^ 

0 10 
\0 0 lj 



When possible, find invertible matrices P lt P 2 and P 3 such that P x l AP lt P 2 1 BP 2 and P 3 l CP z are 
diagonal. 

9.47. Consider A = ^ and B = ^ as matrices over the real field R. Find all eigen- 
values and linearly independent eigenvectors. 

9.48. Consider A and B in the preceding problem as matrices over the complex field C. Find all eigen- 
values and linearly independent eigenvectors. 

9.49. For each of the following operators T : R 2 -» R 2 , find all eigenvalues and a basis for each eigen- 
space: (i) T(x,y) = (3a; + Zy, x + 5y); (ii) T(x,y) = (y,x); (iii) T(x,y) = (y,-x). 

9.50. For each of the following operators T : R 3 -» R 3 , find all eigenvalues and a basis for each 
eigenspace: (i) T(x, y, z) = (x + y + z, 2y + z, 2y + 3z); (ii) T(x, y,z) = (x + y,y + z, —2y — z); 
(iii) T(x, y, z) = (x - y, 2x + Zy + 2z, x + y + 2z). 

9.51. For each of the following matrices over the complex field C, find all eigenvalues and linearly 
independent eigenvectors: 



9.52. Suppose v is an eigenvector of operators S and T. Show that v is also an eigenvector of the operator 
aS + bT where a and 6 are any scalars. 

9.53. Suppose v is an eigenvector of an operator T belonging to the eigenvalue X. Show that for n > 0, 
v is also an eigenvector of T n belonging to X". 

9.54. Suppose X is an eigenvalue of an operator T. Show that /(X) is an eigenvalue of f{T). 

9.55. Show that similar matrices have the same eigenvalues. 

9.56. Show that matrices A and A' have the same eigenvalues. Give an example where A and A 4 have 
different eigenvectors. 

9.57. Let S and T be linear operators such that ST - TS. Let X be an eigenvalue of T and let W be 
its eigenspace. Show that W is invariant under S, i.e. S{W) C W. 

9.58. Let V be a vector space of finite dimension over the complex field C. Let W # {0} be a subspace 
of V invariant under a linear operator f : V -* V. Show that W contains a nonzero eigenvector of T. 

9.59. Let A be an n-square matrix over K. Let v lt . . . , v n G K n be linearly independent eigenvectors of 
A belonging to the eigenvalues Xj, . . . , X„ respectively. Let P be the matrix whose columns are the 
vectors v u ...,v n . Show that P _1 AP is the diagonal matrix whose diagonal elements are the 
eigenvalues X t , . . . , X„. 

CHARACTERISTIC AND MINIMUM POLYNOMIALS 

9.60. For each matrix, find a polynomial for which the matrix is a root: 

'2 3 -2^ 

(i) A = ( * _ ), (ii) B = I - * ), (iii) C = I 0 5 4 
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9.61. Consider the ra-square matrix 



A = 



X 


1 


0 . 


. 0 0 


0 


X 


1 . 


. 0 0 


0 


0 


0 . 


. X 1 


0 


0 


0 . 


. 0 X 



Show that f(t) = (t - X)" is both the characteristic and minimum polynomial of A. 



9.62. Find the characteristic and minimum polynomials of each matrix: 



A = 



2 


5 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


4 


2 


0 


0 


0 


3 


5 


0 


0 


0 


0 


0 





B = 



1 


0 


q 


0 


3 


0 


0 


0 


0 


3 


l 


0 


0 


0 


3 




0 


0 


0 


3 



c = 



/X 0 0 0 0' 
0X000 
0 0X00 
0 0 0 X0 
\0 0 0 



X 0 / 

0 x/ 



/l 1 0^ 



'2 0 0^ 



9.63. Let A = 0 2 0 j and B = I 0 2 2 J . Show that A and B have different characteristic 
\0 0 1/ \0 0 lj 

polynomials (and so are not similar), but have the same minimum polynomial. Thus nonsimilar 
matrices may have the same minimum polynomial. 



9.64. The mapping T : V -» V defined by T(v) = kv is called the scalar mapping belonging to k G K. 
Show that T is the scalar mapping belonging to kGK if and only if the minimal polynomial of 
T is m(t) = t-k. 



9.65. 
9.66. 
9.67. 

9.68. 



Let A be an w-square matrix for which A k = 0 for some k > n. Show that A" = 0. 

Show that a matrix A and its transpose A* have the same minimum polynomial. 

Suppose f(t) is an irreducible monic polynomial for which f(T) = 0 where T is a linear operator 
T : V -» V. Show that /(*) is the minimal polynomial of T. 

Consider a block matrix M = !?V Show that tl — M = A . ^ ' S) is the char- 
acteristic matrix of M. \ c D / \ _c " u > 



9.69. Let T be a linear operator on a vector space V of finite dimension. Let W be a. subspace of V 
invariant under T, i.e. T(W) cW. Let T w : W -* W be the restriction of T to W. (i) Show that 
the characteristic polynomial of T w divides the characteristic polynomial of T. (ii) Show that the 
minimum polynomial of T w divides the minimum polynomial of T. 



9.70. Let A = 



A(t) 



I a n a 12 o 13 

a 21 a 22 a 23 

\ a 31 »32 a 33 , 

* 3 _ (a n + a 22 + a s 



Show that the characteristic polynomial of A is 





+ 




«13 


+ 


a 22 


^23 


(Hi °22 


«31 


«33 




a S2 


a 33 



°11 a 12 a 13 
a 21 a 22 ^23 
a-31 ^32 a 33 



9.71. Let A be an m-square matrix. The determinant of the matrix of order n — m obtained by deleting 
the rows and columns passing through m diagonal elements of A is called a principal minor of degree 
n — m. Show that the coefficient of t m in the characteristic polynomial A(t) = \tl — A| is the sum 
of all principal minors of A of degree n — m multiplied by (— l) n ~ m . (Observe that the preceding 
problem is a special case of this result.) 
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9.72. Consider an arbitrary monic polynomial f(t) = t" + a n - 1 t n " 1 + ■■■ + a x t + a 0 . The following 
n-square matrix A is called the companion matrix of f(t): 



A = 



0 


0 . 


. 0 


— a 0 




0 . 


. 0 


-«i 


0 


1 . 


. 0 


-a 2 


o 


0 . 


. 1 


-«n-l 



Show that f(t) is the minimum polynomial of A. 

9.73. Find a matrix A whose minimum polynomial is (i) t? - 5t 2 + 6t + 8, (ii) - 5t 3 - 2t + It + 4. 
DIAGONALIZATION 

9.74. Let A = ( a b ~\ be a matrix over the real field R. Find necessary and sufficient conditions on 

\c dj 

a, b, c and d so that A is diagonalizable, i.e. has two linearly independent eigenvectors. 

9.75. Repeat the preceding problem for the case that A is a matrix over the complex field C. 

9.76. Show that a matrix (operator) is diagonalizable if and only if its minimal polynomial is a product 
of distinct linear factors. 

9.77. Let A and B be w-square matrices over K such that (i) AB = BA and (ii) A and B are both 
diagonalizable. Show that A and B can be simultaneously diagonalized, i.e. there exists a basis of 
K n in which both A and B are represented by diagonal matrices. (See Problem 9.57.) 

9.78. Let E : V -* V be a projection operator, i.e. E 2 = E. Show that E is diagonalizable and, in fact, 
can be represented by the diagonal matrix A = ^ ^ r ^ ^ where r is the rank of E. 



Answers to Supplementary Problems 

/-26 -3\ /-40 39\ /3 6\ 

9.36. /(A)=( 5 _ 27 j, ,(A)= (_ 65 _ 27 ), /(B) =( 0 9 J, </(*) = 



9.39. 



9.37. f(T)(x, y) = (Ax - y, -2x + by). 

9.38. f(D){v(x)) = -Sax* + (4a - 5b)x + (2a + 26 - 5c). 



'2 a 61 

9.40. ffiwf. Let A = | 0 2 c | . Set B = A 3 and then obtain conditions on a, 6 and c. 
i° 0 2 / 

9.44. (ii) Using (i), we have (/(A))* = /(A*) = /(A). 

9.45. (i) Xj = 1, u = (2,-1); X 2 = 4, * = (1,1). 

(ii) X x = 1, u = (2,-3); X 2 = 6, v = (1,1). 

(iii) X = 4, «=(1.1). 

Let Pj = ^ and P 2 = ^ j. P3 does not exist since C has only one independent 

eigenvector, and so cannot be diagonalized. 
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9.46. (i) = 2, u = (1, -1, 0), v = (1, 0, -1); X 2 = 6, w = (1, 2, 1). 

(ii) X t = 3, it = (1, 1, 0), v = (1, 0, 1); X 2 = 1, w = (2, -1, 1). 

(iii) X = l, « = (1,0,0), « = (0,0,1). 

/ 1 1 1\ /I 1 2\ 

Let P x = — 1 0 2 and P 2 = 1 0 — 1 . P s does not exist since C has at most two 

\ 0 -1 lj \0 1 lj 

linearly independent eigenvectors, and so cannot be diagonalized. 

9.47. (i) X = 3, tt = (1,-1); (ii) B has no eigenvalues (in R). 

9.48. (i) X = 3, u = (1, -1). (ii) Xj = 2i, it = (1, 3 - 2i); X 2 = -2i, d = (1, 3 + 2i> 

9.49. (i) Xi = 2, m = (3, -1); X 2 = 6, v = (1, 1). (ii) Xj = 1, it = (1, 1); X 2 = -1, v = (1, -1). (iii) There 
are no eigenvalues (in R). 

9.50. (i) X t = 1, tt = (1, 0, 0); X 2 = 4, « = (1, 1, 2). 

(ii) X = 1, it = (1, 0, 0). There are no other eigenvalues (in R). 

(iii) Xi = 1, « = (1, 0, -1); X 2 = 2, v = (2, -2, -1); X 3 = 3, w = (1, -2, -1). 

9.51. (i) X! = 1, it = (1,0); X 2 = i, v = (1,1 + t). (ii) X = 1, it = (1,0). (iii) Xi = 2, it = (3,i); X 2 = -2, 
v = (1, -i). (iv) Xj = t, it = (2, 1 - 0; X 2 = -i, v = (2, 1 + i). 

9.56. Let A = f 1 J j . Then X = 1 is the only eigenvalue and v = (1, 0) generates the eigenspace 

\" ' /l 0\ 

of X = 1. On the other hand, for At = I ) , X = 1 is still the only eigenvalue, but w = (0, 1) 

generates the eigenspace of X = 1. > ' 

9.57. Let v £ W, and so T(v) = \v. Then T(Sv) = S(Tv) = S(\v) = \(Sv), that is, Sv is an eigenvector 
of T belonging to the eigenvalue X. In other words, Sv e W and thus S(W) C W. 

9.58. Let r : TP -» W be the restriction of T to W. The characteristic polynomial of T is a polynomial 
over the complex field C which, by the fundamental theorem of algebra, has a root X. Then X is an 
eigenvalue of T, and so T has a nonzero eigenvector in W which is also an eigenvector of T. 

9.59. Suppose T(v) = \v. Then (kT)(v) = kT(v) = Jc(\v) = (fcx)v. 

9.60. (i) /(<) = t 2 -8* + 43, (ii) g(t) = t 2 - St + 23, (iii) h(t) = #-W+U- 12. 

9.62. (i) A(t) = (t-2)»(t-7) 2 ; m(t) = (t-2) 2 (t-7). (ii) M*) = (*~3) 5 ; m(t) = (t-3) 3 . (iii) A(t) = 
(t-X) 5 ; m(i) = *-X. 







0 






9.73. Use the result of Problem 9.72. (i) A = 




0 


-6 


, (ii) A = | 






1 


5/ 






9.77. Hint. Use the result of Problem 9.57. 
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Canonical Forms 

INTRODUCTION 

Let T be a linear operator on a vector space of finite dimension. As seen in the preceding 
chapter, T may not have a diagonal matrix representation. However, it is still possible 
to "simplify" the matrix representation of T in a number of ways. This is the main topic 
of this chapter. In particular, we obtain the primary decomposition theorem, and the 
triangular, Jordan and rational canonical forms. 

We comment that the triangular and Jordan canonical forms exist for T if and only if 
the characteristic polynomial A(t) of T has all its roots in the base field K. This is always 
true if K is the complex field C but may not be true if K is the real field R. 

We also introduce the idea of a quotient space. This is a very powerful tool and will be 
used in the proof of the existence of the triangular and rational canonical forms. 



w-dimensional vector space V. Suppose T can be rep- 

/ an ai2 . . . a\n \ 

G22 . . . a2n 



\ Onn I 

Then the characteristic polynomial of T, 

A(t) = \tI~A\ = {t - an)(t - a 22 ) . . . (t - Onn) 

is a product of linear factors. The converse is also true and is an important theorem; 
namely, 

Theorem 10.1: Let T : V -» V be a linear operator whose characteristic polynomial factors 
into linear polynomials. Then there exists a basis of V in which T is 
represented by a triangular matrix. 

Alternate Form of Theorem 10.1: Let A be a square matrix whose characteristic poly- 
nomial factors into linear polynomials. Then A is 
similar to a triangular matrix, i.e. there exists an in- 
vertible matrix P such that P _1 AP is triangular. 

We say that an operator T can be brought into triangular form if it can be represented 
by a triangular matrix. Note that in this case the eigenvalues of T are precisely those 
entries appearing on the main diagonal. We give an application of this remark. 



TRIANGULAR FORM 

Let T be a linear operator on an 
resented by the triangular matrix 

A = 
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Example 10.1 : Let A be a square matrix over the complex field C. Suppose X is an eigenvalue of AK 
Show that a/x or -Vx is an eigenvalue of A. We know by the above theorem that 
A is similar to a triangular matrix 



B = 

Hence A 2 is similar to the matrix 

£2 = 



I H * 



I fit * 



Since similar matrices have the same eigenvalues, X = /i? for some i. Hence 
iti-y/x or m = -\/x; that is, -\/\ or - \/x is an eigenvalue of A. 



INVARIANCE 

Let T:V-*V be linear. A subspace W of V is said to be invariant under T or 
T-invariant if T maps W into itself, i.e. if v GW implies T(v) G W. In this case T 
restricted ioW defines a linear operator on W; that is, T induces a linear operator T:W-*W 
defined by T(w) = T(w) for every w G W. 

Example 10.2: Let T : R 3 -> R3 be the linear operator which rotates each vector about the z axis 
by an angle e: 

T(x, y, z) = (x cos $ — y sin e, x sin 9 + y cos ff, z) 

Observe that each vector w — 
(a, b, 0) in the xy plane W remains 
in W under the mapping T, i.e. W 
is T-invariant. Observe also that 
the z axis U is invariant under T. 
Furthermore, the restriction of T 
to W rotates each vector about the 
origin O, and the restriction of T 
to U is the identity mapping on U. 



Example 10.3: Nonzero eigenvectors of a linear operator T : V -* V may be characterized as gen- 
erators of T-invariant 1-dimensional subspaces. For suppose T(v) = Kv, v # 0. 
Then W = {kv, k € K }, the 1-dimensional subspace generated by v, is invariant 
under T because 

T(kv) = k T(v) = k(Xv) = kxv e W 

Conversely, suppose dim U = 1 and u¥= 0 generates 17, and £/ is invariant under 
T. Then T(u) e 17 and so T(u) is a multiple of u, i.e. T(tt) = jm. Hence u is an 
eigenvector of T. 

The next theorem gives us an important class of invariant subspaces. 

Theorem 10.2: Let T:V ->V be linear, and let f(t) be any polynomial. Then the kernel 
of f(T) is invariant under T. 

The notion of invariance is related to matrix representations as follows. 

Theorem 10.3: Suppose W is an invariant subspace of T.V-+V. Then T has a block 

(A B\ 

matrix representation I Q c J where A is a matrix representation of 
the restriction of T to W. 
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INVARIANT DIRECT-SUM DECOMPOSITIONS 

A vector space V is termed the direct sum of its subspaces Wi, . . . , W r , written 

V — e W 2 e • • ■ © W r 
if every vector v & V can be written uniquely in the form 

V = Wl + Wi + • • • + Wr With Wi G Wi 

The following theorem applies. 

Theorem 10.4: Suppose Wi,...,W, are subspaces of V, and suppose 

{Wll, . . ., Win,}, {Wrl, ■ ■ ■ , Wrn r } 

are bases of W u . . ., W r respectively. Then V is the direct sum of the 
Wi if and only if the union {wn, . . .,Wm lt ■ ■ .,Wn, . . .,Wm r ) is a basis 
of V. 

Now suppose T : V ■* V is linear and V is the direct sum of (nonzero) T-invariant 
subspaces Wi, . . ., W r : 

V = Wi © • • • © W r and T(Wi) cW h i = 1, . . .,r 

Let Ti denote the restriction of T to Wi. Then T is said to be decomposable into the operators 
Ti or T is said to be the direct sum of the T it written T = Ti © • ■ • © 2V. Also, the sub- 
spaces W lt ...,W r are said to reduce Tor to form a T-invariant direct-sum decomposition of V. 

Consider the special case where two subspaces U and W reduce an operator T : V ■* V; 
say, dimf/ = 2 and dim W = 3 and suppose {«i,«2> and {w h W2,Ws} are bases of U and 
TP respectively. If Ti and T 2 denote the restrictions of T to Z7 and W respectively, then 

T 2 (Wi) - buWi + bliWi + bi 3 w 3 

Ti (ui) = auUi + ai2«2 m , . , , , , , 
v ' T 2 (w 2 ) = &21W1 + baiv-i + 623W3 

V ' T 2 (W 3 ) = &3lWl + b 3 2W Z + O33W3 



Hence 



6ll &21 t>31 

A = ( an tt21 ^ and B = [ Z>» 6« &3 2 

\&13 &23 & 



ai 2 0t22 



are matrix representations of Ti and T 2 respectively. By the above theorem {tti, its, »i, w 2 , w 3 } 
is a basis of V. Since T(«i) = Ti(m) and T(w,) = T 8 (w,), the matrix of T in this basis is 

(A 0 



the block diagonal matrix y Q ^ 

A generalization of the above argument gives us the following theorem. 

Theorem 10.5: Suppose T:V-*V is linear and V is the direct sum of T-invariant sub- 
spaces W u Wr. If At is a matrix representation of the restriction of 
T to W h then T can be represented by the block diagonal matrix 







0 . 


. 0 


M = 


0 


A 2 . 


. . 0 










0 





The block diagonal matrix M with diagonal entries A u ...,A r is sometimes called the 
direct sum of the matrices Ai, . . . , A r and denoted by M = Ai © • • • © A t . 
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PRIMARY DECOMPOSITION 

The following theorem shows that any operator T:V->V is decomposable into oper- 
ators whose minimal polynomials are powers of irreducible polynomials. This is the first 
step in obtaining a canonical form for T, 

Primary Decomposition Theorem 10.6: Let T.V^V be a linear operator with minimal 
polynomial 

m(t) = U («) Bl /2 (*)»«... /,(«)* 

where the fi(t) are distinct monic irreducible polynomials. Then V is the 
direct sum of T-invariant subspaces W u . . . , W r where Wi is the kernel of 
fi(T) n <. Moreover, / 4 is the minimal polynomial of the restriction of 
T to Wu 

Since the polynomials are relatively prime, the above fundamental result follows 

(Problem 10.11) from the next two theorems. 

Theorem 10.7: Suppose T.V^V is linear, and suppose f(t) = g(t)h(t) are polynomials 
such that f(T) = 0 and g(t) and h(t) are relatively prime. Then V is the 
direct sum of the T-invariant subspaces U and W , where U = Ker g(T) 
and W = Ker h(T). 

Theorem 10.8: In Theorem 10.7, if f(t) is the minimal polynomial of T [and g(t) and h(t) 
are monic], then g(t) and h(t) are the minimal polynomials of the restric- 
tions of T to U and W respectively. 

We will also use the primary decomposition theorem to prove the following useful 
characterization of diagonalizable operators. 

Theorem 10.9: A linear operator T : V -* V has a diagonal matrix representation if and 
only if its minimal polynomial m(t) is a product of distinct linear 
polynomials. 

Alternate Form of Theorem 10.9: A matrix A is similar to a diagonal matrix if and only 
if its minimal polynomial is a product of distinct linear polynomials. 

Example 10.4: Suppose A ¥= I is a square matrix for which A 3 = J. Determine whether or not 
A is similar to a diagonal matrix if A is a matrix over (i) the real field R, (ii) the 
complex field C. 

Since A 3 — I, A is a zero of the polynomial f{t) = t 3 - 1 = (t - 1)(*2 + t + 1). 
The minimal polynomial m(t) of A cannot be t — 1, since A I. Hence 

m(t) = & + t + 1 or m(t) = t 3 - 1 

Since neither polynomial is a product of linear polynomials over R, A is not diag- 
onalizable over R. On the other hand, each of the polynomials is a product of distinct 
linear polynomials over C. Hence A is diagonalizable over C. 



NILPOTENT OPERATORS 

A linear operator T:V-*V is termed nilpotent if T n = 0 for some positive integer n; 
we call k the index of nilpotency of T if T k — 0 but T fc_1 ¥= 0. Analogously, a square matrix 
A is termed nilpotent if A" = 0 for some positive integer n, and of index k if A k = 0 but 
A k_1 ¥■ 0. Clearly the minimum polynomial of a nilpotent operator (matrix) of index k is 
m(t) — t k ; hence 0 is its only eigenvalue. 

The fundamental result on nilpotent operators follows. 

Theorem 10.10: Let T.V-+ V be a nilpotent operator of index k. Then T has a block 
diagonal matrix representation whose diagonal entries are of the form 
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N = 



0 


1 


0 . 


. 0 


0 


0 


0 


1 . 


. 0 


0 


0 


0 


0 . 


. 0 


1 


0 


0 


0 . 


. 0 


0 



(i.e. all entries of N are 0 except those just above the main diagonal where 
they are 1). There is at least one N of order k and all other N are of orders 
^ k. The number of N of each possible order is uniquely determined by 
T. Moreover, the total number of N of all orders is equal to the nullity 
of T. 

In the proof of the above theorem, we shall show that the number of N of order i is 
2m 4 — m i+ i — wii-i, where ra ( is the nullity of T\ 

We remark that the above matrix N is itself nilpotent and that its index of nilpotency is 
equal to its order (Problem 10.13). Note that the matrix N of order 1 is just the 1 X 1 zero 
matrix (0). 



JORDAN CANONICAL FORM 

An operator T can be put into Jordan canonical form if its characteristic and minimal 
polynomials factor into linear polynomials. This is always true if K is the complex field C. 
In any case, we can always extend the base field K to a field in which the characteristic 
and minimum polynomials do factor into linear factors; thus in a broad sense every operator 
has a Jordan canonical form. Analogously, every matrix is similar to a matrix in Jordan 
canonical form. 

Theorem 10.11: Let T:V-* V be a linear operator whose characteristic and minimum 
polynomials are respectively 

A(t) = {t- Ai)»> . . . (t - A,)"* and m(t) = (t - \i) m » ...(*- M m ' 
where the Ai are distinct scalars. Then T has a block diagonal matrix 
representation J whose diagonal entries are of the form 

/A; 1 0 ... 0 0\ 



0 Ai 1 



0 0 



J a — 



0 
0 



0 
0 



Ai 
0 



For each A f the corresponding blocks /« have the following properties: 

(i) There is at least one J a of order m; all other J v are of order ^ m. 

(ii) The sum of the orders of the J a is m. 

(iii) The number of J a equals the geometric multiplicity of Ai. 

(iv) The number of J a of each possible order is uniquely determined by T. 
The matrix J appearing in the above theorem is called the Jordan canonical form of the 

operator T. A diagonal block J« is called a Jordan block belonging to the eigenvalue A { . 
Observe that 

0 ... 0 0\ /0 1 
Ai ... 0 0 \ / 0 0 



Ai 
0 



1 

Ai 



0 
1 



0 
0 



/ Ai 

/ o 



0 
0 



0 
0 



0 
0 



Ai 
0 



+ 



\ 



0 
0 



Ai 
0 



0 

Ai 



0 
0 



0 
0 



0 
0 



0 
0 



0 
0 



0 
0 
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That is, 



Jh = XJ + N 



where N is the nilpotent block appearing in Theorem 10.10. In fact, we prove the above 
theorem (Problem 10.18) by showing that T can be decomposed into operators, each the sum 
of a scalar and a nilpotent operator. 



Example 10.5: Suppose the characteristic and minimum polynomials of an operator T are respec- 
tively 

A(<) = (t-2)*(t-3)3 and m(t) = (t-2)"(t-3)2 
Then the Jordan canonical form of T is one of the following matrices: 





The first matrix occurs if T has two independent eigenvectors belonging to its eigen- 
value 2; and the second matrix occurs if T has three independent eigenvectors be- 
longing to 2. 



CYCLIC SUBSPACES 

Let T be a linear operator on a vector space V of finite dimension over K. Suppose 
v G V and v ¥> 0. The set of all vectors of the form f(T)(v), where f(t) ranges over all 
polynomials over K, is a T-invariant subspace of V called the T-cyclic subspace of V gen- 
erated by v;we denote it by Z{v, T) and denote the restriction of T to Z(v, T) by T v . We 
could equivalents define Z(v, T) as the intersection of all T-invariant subspaces of V 
containing v. 

Now consider the sequence 

v, T(v), T*(v), T*{v), . . . 

of powers of T acting on v. Let k be the lowest integer such that T k (v) is a linear com- 
bination of those vectors which precede it in the sequence; say, 

T\v) = -a*-! T*-\v) - ... - ai T(v) - a 0 v 
Then m v (t) = t" + a k -it k ~ l + ■■• + ait + a 0 

is the unique monic polynomial of lowest degree for which m v (T) (v) = 0. We call m v (t) the 
T-annihilator of v and Z(v, T). 

The following theorem applies. 
Theorem 10.12: Let Z(v, T), T v and m v (t) be defined as above. Then: 

(i) The set {v, T(v), . . ., T k ~ l (v)} is a basis oiZ{v, T); hence &\mZ{v,T) = k. 

(ii) The minimal polynomial of T v is m v (t). 

(iii) The matrix representation of T„ in the above basis is 
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C = 



0 0 0 
10 0 
0 10 



0 — a 0 
0 -ai 
0 — a 2 



0 0 0 
0 0 0 



0 -a k - 2 

1 — aic-t 



The above matrix C is called the companion matrix of the polynomial m v (t). 

RATIONAL CANONICAL FORM 

In this section we present the rational canonical form for a linear operator T : V -> V. 
We emphasize that this form exists even when the minimal polynomial cannot be factored 
into linear polynomials. (Recall that this is not the case for the Jordan canonical form.) 

Lemma 10.13: Let T : V -* V be a linear operator whose minimal polynomial is /(*)" where 
f(t) is a monic irreducible polynomial. Then V is the direct sum 

V = Z(vi, T) © • • • © Z(v r , T) 
of T-cyclie subspaces Z(v h T) with corresponding T-annihilators 

/(*)% f(t) n *, f(t) n -, n = ni — n% — • • ■ — n T 
Any other decomposition of V into T-cyclic subspaces has the same number 
of components and the same set of T-annihilators. 
We emphasize that the above lemma does not say that the vectors Vi or the T-cyclic sub- 
spaces Z(vi, T) are uniquely determined by T; but it does say that the set of T-annihilators 
are uniquely determined by T. Thus T has a unique matrix representation 



where the d are companion matrices. In fact, the & are the companion matrices to the 
polynomials /(*)"'. 

Using the primary decomposition theorem and the above lemma, we obtain the following 
fundamental result. 

Theorem 10.14: Let T:V^V be a linear operator with minimal polynomial 




C, 



m (t) = /i(t)- /.(*)»• ... f s (t) m - 
where the fi{t) are distinct monic irreducible polynomials. Then T has a 
unique block diagonal matrix representation 




where the C« are companion matrices. In particular, the C» are the com- 
panion matrices of the polynomials /i(£) nii where 

mi = nu s= n i2 ^ • ■ • - nir t , . . . , w^fa-fa-'" - n S r s 
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The above matrix representation of T is called its rational canonical form. The poly- 
nomials fi(t) n u are called the elementary divisors of T. 

Example 10.6: Let V be a vector space of dimension 6 over B, and let T be a linear operator whose 
minimal polynomial is m(t) = (t 2 - t + 3)(* - 2) 2 . Then the rational canonical form 
of T is one of the following direct sums of companion matrices: 

(i) C(P-t + S) © C(t*-t + 3) © C((i-2)2) 

(ii) C(t 2 -« + 3) © C((t-2)2) © C((t-2) 2 ) 

(iii) C(t*-t + 3) © C((i-2)2) © C(t-2) © C(t-2) 
where C(/(t)) is the companion matrix of /(«); that is, 




-3 I 



.L 



\ 



12, 



(i) 



(ii) 



(iii) 



QUOTIENT SPACES 

Let V be a vector space over a field K and let W be a subspace of V. If v is any vector 
in V, we write v + W for the set of sums v + w with w £ W: 

v + W = »€W} 

These sets are called the cosets of W in V. We show (Problem 10.22) that these cosets 
partition V into mutually disjoint subsets. 

Example 10.7: Let W be the subspace of R 2 defined 
by 

W = {(a, 6) : a = 6} 

That is, W is the line given by the 
equation x — y = 0. We can view 
s+Wssa translation of the line, 
obtained by adding the vector v to 
each point in W. As noted in the 
diagram on the right, v + W is also 
a line and is parallel to W. Thus 
the cosets of W in R 2 are precisely 
all the lines parallel to W. 

In the next theorem we use the cosets of a subspace W of a vector space V to define a 
new vector space; it is called the quotient space of V by W and is denoted by V/W. 

Theorem 10.15: Let W be a subspace of a vector space over a field K. Then the cosets of 
W in V form a vector space over K with the following operations of addi- 
tion and scalar multiplication: 

(i) {u + W) + (v + W) = (u + v) + W 

(ii) k(u + W) = ku + W, where k<=K. 

We note that, in the proof of the above theorem, it is first necessary to show that the 
operations are well defined; that is, whenever u + W = u' + W and v + W = v' + W, then 

(i) (u + v) + W = (u' + V) + W and (ii) k u + W = ku' + W, for any k G K 
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In the case of an invariant subspace, we have the following useful result. 

Theorem 10.16: Suppose W is a subspace invariantunder a linear operator^ T : V -* V. 

Then T induces a linear operator T on V/W denned by T(v + W) = 
T(v) + W. Moreover, if T is a zero of any polynomial, then so is T. 
Thus the minimum polynomial of T divides the minimum polynomial of T. 



Solved Problems 

INVARIANT SUBSPACES 

10.1. Suppose T : V -» V is linear. Show that each of the following is invariant under T: 
(i) {0}, (ii) V, (iii) kernel of T, (iv) image of T. 

(i) We have T(0) = 0 £ {0}; hence {0} is invariant under T. 

(ii) For every «£V, T(v) € V; hence V is invariant under T. 

(iii) Let u e Ker T. Then T(u) = 0 G Ker T since the kernel of T is a subspace of V. Thus 
Ker T is invariant under T. 

(iv) Since T(v) e Im T for every v GV, it is certainly true if v £ Im T. Hence the image of 
T is invariant under T. 



10.2. Suppose {Wi} is a collection of T-invariant subspaces of a vector space V. Show that 
the intersection W = HiWi is also T-invariant. 

Suppose v e W; then v S W t for every i. Since W t is T-invariant, T(v) e W f for every i. 
Thus T(v) SW= n j Wi and so W is T-invariant. 



10.3. Prove Theorem 10.2: Let T : V -* V be any linear operator and let f(t) be any poly- 
nomial. Then the kernel of f(T) is invariant under T. 

Suppose v G Ker/(r), i.e. f(T)(v) = 0. We need to show that T(v) also belongs to the kernel 
of f(T), i.e. f(T)(T(v)) = 0. Since f(t) t = tf(t), we have f(T) T = T f(T). Thus 

f(T)T(v) = Tf(T)(v) = T(0) = 0 

as required. 



10.4. Find all invariant subspaces of A — 



A(t) = \tI-A\ = 



'2 -5' 
1 ~ 2 , 

First of all, we have that R 2 and {0} are invariant under A. Now if A has any other invariant 
subspaces, then it must be 1-dimensional. However, the characteristic polynomial of A is 

t-2 5 
-1 t + 2 



viewed as an operator on R 2 . 



= t 2 + l 



Hence A has no eigenvalues (in R) and so A has no eigenvectors. But the 1-dimensional invariant 
subspaces correspond to the eigenvectors; thus R 2 and {0} are the only subspaces invariant under A. 



10.5. Prove Theorem 10.3: Suppose W is an invariant subspace of T:V-*V. Then T 

has a block diagonal matrix representation ( ) where A is a matrix representa- 

tion of the restriction f of T to W. ^ ' 

We choose a basis {w u ...,w T } of W and extend it to a basis {w t , ...,w„Vi v s } of V. 

We have 
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T(wj) - T(w t ) = a n w 1 + • ■ ■ + a lr w r 
a. 

T(w 2 ) = T(w 2 ) = o 21 w 1 +---+o 2r w r 

r(w r ) - r(t« r ) = o rt «i, + • • • + a„w r 

T(v{) = b nWl + • • • + b lr w r + cn-uj + • • • + c ls v s 
T(v 2 ) = 62^! + • • • + b 2r w r + c 21 v y + • • • + c 2s v s 

T(v t ) = 6 sl w 1 + • • • + b sr w T + Cjii/j + • • • + c ss v s 
But the matrix of T in this basis is the transpose of the matrix of coefficients in the above system 
of equations. (See page 150.) Therefore it has the form [ ^ ^) where A is the transpose of 



0 C, 

the matrix of coefficients for the obvious subsystem. By the same argument, A is the matrix of 
T relative to the basis of W. 

10.6. Let T denote the restriction of an operator T to an invariant subspace W, i.e. 
T(w) = T(w) for every w G W. Prove: 

(i) For any polynomial f(t), f(f)(w) = f(T)(w). 

(ii) The minimum polynomial of T divides the minimum polynomial of T. 

(i) If f(t) = 0 or if /(t) is a constant, i.e. of degree 1, then the result clearly holds. Assume 
deg/ = n > 1 and that the result holds for polynomials of degree less than n. Suppose that 

f(t) = a„t» + a„_it«-i + ■■■ + ai t + a 0 

Then f{T)(w) = (a n T» + a^T"-* + • • • + «„/)(«>) 

= (a B r»-i)(r(w)) + K-! ?»-!+•••+ oo/)(w) 
= (o„r»- !)(?(«;)) + (o n _!r»-i + • • • + oo^(w) 

(ii) Let w(t) denote the minimum polynomial of T. Then by (i), m(T)(w) = m(T)(w) = 0(w) = 0 
for every w E W; that is, T is a zero of the polynomial m(t). Hence the minimum polynomial 
of T divides m(t). 

INVARIANT DIRECT-SUM DECOMPOSITIONS 

10.7. Prove Theorem 10.4: Suppose Wi, . . . , W r are subspaces of V and suppose, for 
i = 1, . . . , r, {wn, .... WinJ is a basis of Then V is the direct sum of the Wi if 
and only if the union 

, . „ TT 5 = {Wn, . . . , Wm v Wrl, . . ., Wm r ) 

is a basis of V. 

Suppose B is a basis of V. Then, for any v G V, 

v = OnWn + • • • + a lni w lni + • • • + a rl w rl + • • • + a^w^ = w 1 + w 2 + ■•■ + w r 

where w i = a a w n + • • • + a-m^in. ^ W t . We next show that such a sum is unique. Suppose 

v — w [ + w 2 + • • • + w' r where w[ e Wi 

Since {w n , . . . , M> in .} is a basis of TFj, w- = b tl w a + • • • + b in ,w in . and so 

v = b n w n + • ■ • + 6 lni w lni + • • • + 6 rl w rl + ■ • ■ + b rn w mr 

Since B is a basis of V, a tj = b ijt for each i and each j. Hence w i = w[ and so the sum for v 
is unique. Accordingly, V is the direct sum of the W t . 

Conversely, suppose V is the direct sum of the W t . Then for any v S V, v = w x + • ■ • + w r 
where w { €= Since is a basis of each w t is a linear combination of the Wy 4 and so v 

is a linear combination of the elements of B. Thus B spans V. We now show that B is linearly 
independent. Suppose 

a n w n + • • • + a ln w lni + • • • + a rl w rl + 
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Note that a n w a + • • • + a in .w in . G W { . We also have that 0 = 0 + 0+ -- -+ 0 where 0 G W t . Since 
such a sum for 0 is unique, 

a a w a + ■■■ + a in .w ln . = 0 for i = 1, . . . , r 

The independence of the bases {w fj .} imply that all the o's are 0. Thus B is linearly independent 
and hence is a basis of V. 



10.8. Suppose T : V -» V is linear and suppose T = 2\ © T 2 with respect to a T-invariant 
direct-sum decomposition V = 17 © Show that: 

(i) w(i) is the least common multiple of mi(t) and m 2 (£) where m(i), mi{t) and m 2 (i) 
are the minimum polynomials of T, Ti and T 2 respectively; 

(ii) A(i) = Ai(«) A 2 (£), where A(£), Ai(t) and A 2 (t) are the characteristic polynomials of 
T, Ti and T 2 respectively. 

(i) By Problem 10.6, each of m,(t) and m^t) divides m(t). Now suppose /(t) is a multiple of both 
m^t) and m 2 (t); then fiTJiU) = 0 and f(T 2 )(W) = 0. Let t£V; then i> = m + w with 
m e 17 and w G W. Now 

f(D v = f(T) u + f{T) w = f(Ti) u + f{T 2 ) w = 0 + 0 = 0 

That is, T is a zero of f(t). Hence divides /(t), and so m(t) is the least common multiple of 
m^t) and m 2 (t). 

(ii) By Theorem 10.5, T has a matrix representation M = where A and B are matrix 
representations of Tj and T 2 respectively. Then, by Problem 9.66, 

ti - A 0 
M t) = \tI-M\ = Q tJ B 

as required. 



= \tI-A\\tI-B\ = Ai(t)A 2 (t) 



10.9. Prove Theorem 10.7: Suppose T : V -* V is linear, and suppose /(£) = are 
polynomials such that f{T) = 0 and g(t) and h(t) are relatively prime. Then V 
is the direct sum of the T-invariant subspaces U and W where J7 = Kerfir(T) and 
W = Ker h(T). 

Note first that U and are T-invariant by Theorem 10.2. Now since g(t) and h(t) are relatively 
prime, there exist polynomials r(t) and s(t) such that 

r(t) «r(t) + s(t) M0 = 1 

Hence for the operator T, r(T) g(T) + s(T) h(T) = I (*) 

Let v G V; then by (*), v = r(7) g(T) v + s(T) h(T) v 

But the first term in this sum belongs to W-Kerh(T) since 

h(T) r(T) g(T) v = r(T) g(T) h(T) v = r(T)f(T)v = r(T)0v = 0 

Similarly, the second term belongs to U. Hence V is the sum of U and W. 

To prove that V = U © W, we must show that a sum v = u + w with u G [7, w G W, is 
uniquely determined by v. Applying the operator r(T)g(T) to v = u + w and using flr(r)w = 0, 
we obtain 

r(T)g(T)v = r(T) g(T) u + r(T) g(T) w = r(T)g(T)w 

Also, applying (*) to w alone and using h(T) w = 0, we obtain 

w = r(T) g(T) w + s(T) h(T) w = r(T) g(T) w 

Both of the above formulas give us w = r(T) g(T) v and so w is uniquely determined by v. Sim- 
ilarly w is uniquely determined by v. Hence V = U © W, as required. 
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10.10. Prove Theorem 10.8: In Theorem 10.7 (Problem 10.9), if f(t) is the minimal poly- 
nomial of T (and g(t) and h(t) are monic), then g{t) is the minimal polynomial of the 
restriction Ti of T to U and k(t) is the minimal polynomial of the restriction T 2 of 
Tto W. 

Let m^t) and m 2 (t) be the minimal polynomials of T t and T 2 respectively. Note that g(T^) = 0 
and h(T 2 ) = 0 because U = Ker g(T) and W = Ker h(T). Thus 

mi(t) divides and m 2 (t) divides h(t) (J) 

By Problem 10.9, f(t) is the least common multiple of m^t) and m 2 (t). But m t (t) and m 2 (t) are 
relatively prime since g{t) and /i(t) are relatively prime. Accordingly, f(t) = mj(t) m 2 (t). We also 
have that f(t) — g(t) h(t). These two equations together with (1) and the fact that all the polynomials 
are monic, imply that g(t) — m^t) and h(t) = rn^t), as required. 



10.11. Prove the Primary Decomposition Theorem 10.6: Let T : V ■* V be a linear operator 
with minimal polynomial 

Mt) = fi(t) ni f2(t) n *...f r {t) n * 

where the fi(t) are distinct monic irreducible polynomials. Then V is the direct sum 
of T-invariant subspaces Wi, ...,W r where W, is the kernel of /i(T)"'. Moreover, 
/{(£)"< is the minimal polynomial of the restriction of T to Wu 

The proof is by induction on r. The case r = 1 is trivial. Suppose that the theorem has been 
proved for r — 1. By Theorem 10.7 we can write V as the direct sum of T-invariant subspaces W t 
and Vj where W t is the kernel of f l (T) n i and where Vj is the kernel of f 2 (T) n % . . . f r (T) n r. By 
Theorem 10.8, the minimal polynomial of the restrictions of T to Wi and V, are respectively /^t)" 1 
and / 2 (t)«» ... / r («)«-. 

Denote the restriction of T to V t by T t . By the inductive hypothesis, is the direct sum of 
subspaces W 2 W r such that W { is the kernel of fiiT^i and such that / t (t)"< is the minimal poly- 
nomial for the restriction of T 1 to W ( . But the kernel of f { (T) n i, for i = 2, . . .,r is necessarily 
contained in V l since /^t)"' divides / 2 (t)"s . . . / r (t)" r - Thu s the kernel of / j (7 , ) n i is the same as the 
kernel of /^r^";, which is W t . Also, the restriction of T to W i is the same as the restriction of T 1 
to W{ (for i = 2, . . .,r); hence is also the minimal polynomial for the restriction of T to TVj. 

Thus V = W t ® W 2 © • ■ • © W r is the desired decomposition of T. 



10.12. Prove Theorem 10.9: A linear operator T : V -*■ V has a diagonal matrix representa- 
tion if and only if its minimal polynomial m(t) is a product of distinct linear 
polynomials. 

Suppose m(t) is a product of distinct linear polynomials; say, 

m(t) = (t-\,)(t-X 2 ) ... (t-\ r ) 

where the X { are distinct scalars. By the primary decomposition theorem, V is the direct sum of 
subspaces W v ...,W r where W { = Ker(2'-X j /). Thus if « £ W u then (T-\J)(v) = 0 or 
T(v) — Xj-y. In other words, every vector in W x is an eigenvector belonging to the eigenvalue X ; . By 
Theorem 10.4, the union of bases for W u . . . , W r is a basis of V. This basis consists of eigenvectors 
and so T is diagonalizable. 

Conversely, suppose T is diagonalizable, i.e. V has a basis consisting of eigenvectors of T. Let 
Xj X s be the distinct eigenvalues of T. Then the operator 

f(T) = (r-x,/)(r-x 2 /) ... (r-x s /) 

maps each basis vector into 0. Thus f(T) = 0 and hence the minimum polynomial m(t) of T divides 
the polynomial 

f(t) = (t-X^-X-j) ... (t-Xj) 



Accordingly, m(t) is a product of distinct linear polynomials. 
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NILPOTENT OPERATORS, JORDAN CANONICAL FORM 

10.13. Let T:V -> V be linear. Suppose, for v G V, T k (v) = 0 but r fc -»(t;)#0. Prove: 

(i) The set S = {v, T(v), r fc-1 (v)} is linearly independent. 

(ii) The subspace W generated by S is T-invariant. 

(iii) The restriction f of T to W is nilpotent of index k. 

(iv) Relative to the basis {T k ~ l (v), . . .,T{v),v} of W, the matrix of T is of the form 



/0 1 0 . 
/ 0 0 1 . 



0 0 0 
\0 0 0 



0 0\ 
0 0 ^ 



0 '/ 
0 0/ 



(i) 



(*) 



Hence the above fc-square matrix is nilpotent of index k. 

Suppose 

av + a^iv) + a 2 T2(v) + ••• + « M n-'(«) = 0 

Applying T*-* to (*) and using ^(d) = 0, we obtain aT k -^(v) = 0; since T k ~ 1 (v) ¥° 0, a = 0. 
Now applying to (*) and using T k (v) = 0 and a = 0, we find £4 T 1 *- 1 ^) = 0; hence 

a t = 0. Next applying T k ~ s to (*) and using T k {v) = 0 and a = a t = 0, we obtain 
o 2 r fc_1 (t>) = 0; hence a 2 = 0. Continuing this process, we find that all the a's are 0; hence 
S is independent. 



(ii) Let v e W. Then 

v = bv + biTiv) + b 2 THv) + ••• + b k . 1 T k ~ i (v) 
Using T k (v) = 0, we have that 

T(v) = bT(v) + bi^v) + ■■■ + b k _ 2 Tk-i(v) e W 
Thus W is T-invariant. 



(iii) By hypothesis T k (v) = 0. Hence, for i = 0, . . .,fc— 1, 

rk(r*(v)) = r fc+i (i;) = o 

That is, applying r fc to each generator of W, we obtain 0; hence T k = 0 and so !T is nilpotent 
of index at most fc. On the other hand, T k ~ l (v) — T k ~ l (v) # 0; hence T is nilpotent of index 
exactly fc. 

(iv) For the basis {T k ~^v), 7*~2(v), . . ., T(v),v) of W, 

T(T k ~Hv)) = T k (v) = 0 
T(T k -2(v)) = T k ~ l (v) 
T(T k ~3(y)) = T k ~*{v) 



T(T(v)) 
T(v) 

Hence the matrix of T in this basis is 



T 2 (v) 



T(v) 



0 


1 


0 . 


. 0 


0 


0 


0 


1 . 


. 0 


0 


0 


0 


0 . 


. 0 


1 


0 


0 


0 . 


. 0 


0 
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10.14. Let T-.V-+V be linear. Let U = KerT i and W = KerT i+1 . Show that (i) UcW, 
(ii) T(W) c U. 

(i) Suppose uGU = Kern Then P(u) = 0 and so r*+i(tt) = T(P(u)) = T(0) = 0. Thus 
ueKerP+i = TP. But this is true for every uGV; hence 1/ C W. 

(ii) Similarly, if w£H r =KerI H +', then r*+Mw) = 0. Thus 2' i+1 (w) = ^'(^(w)) = r*(0) = 0 
and so T(W) c t/. 



10.15. Let T : V -» V be linear. Let X = Ker 2"- 2 , Y = Ker T i_1 and Z = Ker T\ By the 
preceding problem, XcYcZ. Suppose 

{U U . . . , Ur}, {Ml, . . . , Mr, Vi, . . ., V s ], {Ml, . . . , M r , Vl, . . . , V„ W\, . . . , Wt} 

are bases of X, Y and ^ respectively. Show that 

S = {«,, . . ., M r , r(Wi), . . ., T(«7»)} 

is contained in Y and is linearly independent. 

By the preceding problem, T(Z) c Y and hence ScY. Now suppose S is linearly dependent. 
Then there exists a relation 

a^i + ■■■ + a r u r + b t r(w t ) + • • • + b t T(w t ) = 0 

where at least one coefficient is not zero. Furthermore, since {mJ is independent, at least one of the 
b k must be nonzero. Transposing, we find 

6 l T(wi) + •■• + b t T(w t ) = -a,M! - ••• - a r u r e X = Ker IT 4-2 
Hence r'-2(6,r(w,) + • • • + b t T(w t )) = 0 

Thus T l ~ l (b l w 1 + ••• + b t w t ) = 0 and so b 1 w 1 + ■ • ■ + b t w t e F = KerT«-i 

Since {ttj, Dj} generates F, we obtain a relation among the u it v s and w fc where one of the coefficients, 
i.e. one of the b k , is not zero. This contradicts the fact that {w i( v jt w k } is independent. Hence S 
must also be independent. 



10.16. Prove Theorem 10.10: Let T:V-*V be a nilpotent operator of index k. Then T 
has a block diagonal matrix representation whose diagonal entries are of the form 



1 
0 



0 
1 



N = 



0 
0 



0 
0 



0 
0 



0 
0 



II 



There is at least one N of order k and all other N are of orders ^ k. The number of 
N of each possible order is uniquely determined by T. Moreover, the total number of 
N of all orders is the nullity of T. 

Suppose dim V = n. Let = Ker T, W 2 = Ker T* W k - Ker Set m { = dim W it 

for i — ...,k. Since T is of index k, W k = V and W k -i V and so m k _ 1 <m k -n. By 
Problem 10.17, 

W t c W 2 c • • • c W k = F 

Thus, by induction, we can choose a basis {w lf . . .,«,} of V such that {«!, . . .,tt mi > is a basis of W t . 

We now choose a new basis for V with respect to which T has the desired form. It will be con- 
venient to label the members of this new basis by pairs of indices. We begin by setting 

v(l, k) = M mfc _ 1 + i, v{2, k) - u mkl + 2 , v{m k -m k _ lt k) = w mfc 
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and setting 

v(l, k - 1) = Tv(l, fc), v{2, k - 1) = Tv(2, fc), . .., v^-m^, k- 1) = Tv{m k -m^ 1 , k) 
By the preceding problem, 

S t = {% . ... M mfc2 , v(l, fe-1), v(m k ~m k ^ 1 ,k-l)} 

is a linearly independent subset of W k - V We extend S x to a basis of W k -x by adjoining new ele- 
ments (if necessary) which we denote by 

v(m k -m k _ 1 + l, k-1), v(m k -m k - 1 + 2, k-l), v(m k _ 1 - m fc _ 2 , k - 1) 

Next we set 

v(l, k - 2) = Tv(l, k - 1), v(2, k-2) = Tv(2, k-1), 

v(m k _ 1 - m k _ 2 , k - 2) = Tv(m k ^ 1 - m fc _ 2 , fc - 1) 
Again by the preceding problem, 

S 2 = . . ., u mk s , v(l, fc-2), . . ., v(m k ^ 1 -m k ^ 2 , k-2)} 

is a linearly independent subset of W k - 2 which we can extend to a basis of W fc _ 2 by adjoining 
elements 

v(m k _ 1 -m k _ 2 + 1, fc-2), ■y(m fc _ I -m fc _ 2 + 2, /c-2), .... i;(m fc _ 2 - m fc _ 3 , A; - 2) 

Continuing in this manner we get a new basis for V which for convenient reference we arrange 
as follows: 

v(l, k), v(m k -m k _ lt k) 

fc-1), v(m k -m k _i,k-\), v(m k _ 1 -m k - 2 ,k-l) 

v(l, 2), v(m k -m k _ 1 ,2), v(m k - x - m k _ 2 ,2), v^-m^) 

v(l,l), v(m k -m k - x , 1), v(m k ^ 1 -m k ^ 2 , 1), v(m 2 -m lt l), v(m lt 1) 

The bottom row forms a basis of W u the bottom two rows form a basis of W 2 , etc. But what is 
important for us is that T maps each vector into the vector immediately below it in the table or into 
0 if the vector is in the bottom row. That is, 

\v(i, j — 1) for j > 1 
0 for j = 1 



Tv{i, j) = 



Now it is clear (see Problem 10.13(iv)) that T will have the desired form if the v(i,j) are ordered 
lexicographically: beginning with v(l, 1) and moving up the first column to v(l, k), then jumping to 
v(2, 1) and moving up the second column as far as possible, etc. 
Moreover, there will be exactly 

m k — Mife-! diagonal entries of order k 

(m k - 1 — m fc _ 2 ) — {m k — m k -i) — 2m fc _ I — m k — m k _ 2 diagonal entries of order fc — 1 



2ra2 — m 1 — m 3 diagonal entries of order 2 

2mtj — m 2 diagonal entries of order 1 



as can be read off directly from the table. In particular, since the numbers m x , . . . , m k are uniquely 
determined by T, the number of diagonal entries of each order is uniquely determined by T. Finally, 
the identity 

mi = (m k -m fc _i) + (2m lt _ 1 -mt fc -m fc _ 2 ) + ••• + (2m 2 -m 1 -m 3 ) + (2m 1 -m 2 ) 
shows that the nullity m,^ of T is the total number of diagonal entries of T. 



'0 1 1 0 l\ 
0 0 1 1 1 \ 



10.17. Let A = ( 0 0 0 0 0 
0 0 0 0 0 

^0 0 0 0 o/ 

hence A is nilpotent of index 2. 
which is similar to A. 



Then A 2 = 



0 


0 


1 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



and A 3 = 0; 



Find the nilpotent matrix M in canonical form 
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Since A is nilpotent of index 2, M contains a diagonal block of order 2 and none greater than 
2. Note that rank A = 2; hence nullity of A = 5 - 2 = 3. Thus M contains 3 diagonal blocks. 
Accordingly M must contain 2 diagonal blocks of order 2 and 1 of order 1; that is, 



10.18. Prove Theorem 10.11, page 226, on the Jordan canonical form for an operator T. 

By the primary decomposition theorem, T is decomposable into operators T it . . . , T T , i.e. 
T = T 1 © • • • © T T , where (t — X f ) m t is the minimal polynomial of T t . Thus in particular, 

(2 1 , - X^m. = 0, (7V-M) m ' = 0 

Set N { = T t - Xj7. Then for i = 1 r, 

T< = + Xji, where Ar»>< = 0 

That is, T i is the sum of the scalar operator \I and a nilpotent operator N it which is of index m { 
since (t — Xj) m i is the minimal polynomial of T { . 

Now by Theorem 10.10 on nilpotent operators, we can choose a basis so that AT { is in canonical 
form. In this basis, T { = N i + X ( / is represented by a block diagonal matrix M { whose diagonal 
entries are the matrices J {j . The direct sum J of the matrices Af 4 is in Jordan canonical form and, 
by Theorem 10.5, is a matrix representation of T. 

Lastly we must show that the blocks J iS satisfy the required properties. Property (i) follows 
from the fact that N { is of index m { . Property (ii) is true since T and J have the same character- 
istic polynomial. Property (iii) is true since the nullity of N t = T { — X { 7 is equal to the geometric 
multiplicity of the eigenvalue X f . Property (iv) follows from the fact that the T i and hence the N t 
are uniquely determined by T. 



10.19. Determine all possible Jordan canonical forms for a linear operator T : V -» V whose 
characteristic polynomial is A(f) = (t — 2) 3 (t — 5) z . 

Since t — 2 has exponent 3 in A(t), 2 must appear three times on the main diagonal. Similarly 
5 must appear twice. Thus the possible Jordan canonical forms are 



M = 





1 

2 1 



2 



5 1 
5 




(i) 



(«) 



(iii) 




(iv) 



(v) 



(vi) 
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10.20. Determine all possible Jordan canonical forms / for a matrix of order 5 whose 
minimal polynomial is m(t) — (t — 2) 2 . 

J must have one Jordan block of order 2 and the others must be of order 2 or 1. Thus there 
are only two possibilities: 



J = 



2 1 I 
2 1 



I 2 1 | 
| 2 I 

I ;_|. 



2 1 ! 



J = 



I 2 / 

Note that all the diagonal entries must be 2 since 2 is the only eigenvalue. 



I 2 / 



QUOTIENT SPACE AND TRIANGULAR FORM 

10.21. Let W be a subspace of a vector space V. Show that the following are equivalent: 
(i)uGv + W, (ii) u-vGW, (iii) vGu + W. 

Suppose u G v + W. Then there exists w 0 £ W such that u = v + w 0 . Hence u — v = w 0 G W. 
Conversely, suppose u — v G W. Then u — v = w a where w 0 G W. Hence u = v + w 0 £ v + W. 
Thus (i) and (ii) are equivalent. 

We also have: u — v& W iff — (u — v) — v — u e. W iff »e«+W. Thus (ii) and (iii) are 
also equivalent. 



10.22. Prove: The cosets of W in V partition V into mutually disjoint sets. That is: 

(i) any two cosets u + W and v + W are either identical or disjoint; and 

(ii) each v G V belongs to a coset; in fact, v Gv + W. 

Furthermore, u + W = v + W if and only if u — vGW, and so (v + w) + W = v + W 
for any w G W. 

Let v G V. Since 0 G W, we have v = v + 0£v + W which proves (ii). 

Now suppose the cosets u+W and v + W are not disjoint; say, the vector x belongs to both 
u+W and v+W. Then u — x € W and x — v G W. The proof of (i) is complete if we show that 
u + W = v+W. Let u + w 0 be any element in the coset u + W. Since u — x, x — v and w 0 belong 
to W, 

(u + w 0 ) — v = (it — ») + (* — v) + w 0 G W 

Thus u + w 0 G v + W and hence the coset u+W is contained in the coset v + W. Similarly v + W 
is contained in u + W and so u+W = v + W. 

The last statement follows from the fact that u+W = v + W if and only if u&v + W, and 
by the preceding problem this is equivalent to u — v G W. 



10.23. Let W be the solution space of the homo- 
geneous equation 2x + Zy + 4z = 0. De- 
scribe the cosets of W in R 3 . 

W is a plane through the origin O — (0, 0, 0), 
and the cosets of W are the planes parallel to W. 
Equivalently, the cosets of W are the solution sets 
of the family of equations 

2x + Sy + 4z — k, k G R 

In particular the coset v + W, where v = (a, b, c), 
is the solution set of the linear equation 

2x + Zy + 4z = 2a + 36 + 4c 



v + W^ 



TT 7 



2(x -a) + 3(y - 6) + 4(z - c) - 0 
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10.24. Suppose W is a subspace of a vector space V. Show that the operations in Theorem 
10.15, page 229, are well denned; namely, show that if u + W = u' + W and v + W = 
v' + W, then 

(i) (u + v) + W = {u' + v') + W and (ii) ku + W = ku' + W, for any k G K 

(i) Since it + W — u' + W and v + W = v' + W, both it — u' and i> — v' belong to W. But then 
(« + «)- (m' + d') = (it - it') + (v - v') G W. Hence (u + v) + W = (it' + «') + W. 

(ii) Also, since u — u' S W implies k(u — it') G W, then fcit — felt' = k(u — it') G W; hence 
ku+W = ku' + W. 

10.25. Let V be a vector space and W a subspace of V. Show that the natural map -q : V -» WW, 
denned by 17(17) = v + W, is linear. 

For any it, v G V and any fe G K, we have 

i;(it + v) = u + v + W - u + W + v + W = 7j(it) + v(v) 

and i(kv) = kv + W = k(v + W) = k v (v) 

Accordingly, 7; is linear. 



10.26. Let W be a subspace of a vector space V. Suppose {wi, . . . , w T } is a basis of W and 
the set of cosets {vi, . . ., v s ), where Vj = Vj + W, is a basis of the quotient space. 
Show that B = {vi, Wi, ... , w r } is a basis of V. Thus dim V = dim W + 

dim (WW). 

Suppose it G V. Since {-0^} is a basis of V/W, 

ii = it + W — afti + 02*2 + • • • + a s * s 
Hence it - a 1 v 1 + • ■ • + a. s v s + w where w GW. Since {wj is a basis of W, 

it = a 1 v 1 + ••• + a s v s + + • • ■ + b r w r 
Accordingly, B generates V. 

We now show that B is linearly independent. Suppose 

c i v i + • " ' + c s v s + diWj + ■ • ■ + d T w T = 0 (i) 

Then CxV! + ■ • • + c s D s = 0 = W 

Since {-0 3 } is independent, the c's are all 0. Substituting into (J), we find djW! + • ■ • + d r w r = 0. 
Since {wj is independent, the d's are all 0. Thus B is linearly independent and therefore a basis 
of V. 



10.27. Prove Theorem 10.16: Suppose W is a subspace invariant under a linear operator 
T : V -> V. Then T induces a linear operator T on WW defined by f(v + W) = 
T(v) + W. Moreover, if T is a zero of any polynomial, then so is T. Thus the mini- 
mum polynomial of T divides the minimum polynomial of T. 

We first show that f is well defined, i.e. if u+W = v + W then f(u+W) = f(v + W). If 
u + W = v + W then u - v G W and, since W is T-invariant, T^n - v) = r(it) - 7» G W. 
Accordingly, 

T(u+W) = r(«) + W = r(i>) + W = T(y + W) 

as required. 

We next show that t is linear. We have 

t ((u + W) + (v + W)) = f(u + v + W) = TOt + i^ + W = T(u) + T(v) + W 
= T(u) + W + T(v) + W = t(u + W) + f(v + W) 

and 

f (fc(it + W)) = f (ku + W) = T{ku) + W = kT(u) + W = k(T(u) + W) = kf(u+ W) 
Thus f is linear. 
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Now, for any coset u + W in V/W, 
T2( U + W) = T 2 (u) + W — T(T(u)) + W = f(T(u) + W) — f(f(u + W)) = T 2 {u + W) 

Hence T 2 — T 2 . Similarly T n = f" for any n. Thus for any polynomial 

f(t) = a n t» + - • • + a 0 = 2 afi, 

f{T)(u+W) = f(T)(u) + W = Sa^M) + W = S^r'M + w) 

- ^aiTHu + W) = ^apHu+W) = (Saflyu + W) = f(f)(u+W) 

and so J{T) - f(f). Accordingly, if T is a root of /(*) then f(T) = 0 = W - f(f), i.e. f is also a 
root of f(t). Thus the theorem is proved. 



10.28. Prove Theorem 10.1: Let T.V-+ V be a linear operator whose characteristic poly- 
nomial factors into linear polynomials. Then V has a basis in which T is represented 
by a triangular matrix. 

The proof is by induction on the dimension of V. If dim V = 1, then every matrix representa- 
tion of T is a 1 by 1 matrix which is triangular. 

Now suppose dim V = n > 1 and that the theorem holds for spaces of dimension less than n. 
Since the characteristic polynomial of T factors into linear polynomials, T has at least one eigen- 
value and so at least one nonzero eigenvector v, say T(v) — a n v. Let W be the 1-dimensional sub- 
space spanned by v. Set V = V/W. Then (Problem 10.26) dim V = dim V — dim W = n - 1. Note 
also that W is invariant under T. By Theorem 10.16, T induces a linear operator f on V whose 
minimum polynomial divides the minimum polynomial of T. Since the characteristic polynomial of 
T is a product of linear polynomials, so is its minimum polynomial; hence so are the minimum 
and characteristic polynomials of t. Thus V and f satisfy the hypothesis of the theorem. Hence, 
by induction, there exists a basis .... *„} of V such that 

f (C 2 ) = a 2i i> 2 

f(v 3 ) = a 32 C 2 + o 33 * 3 

Now let v 2 , . . . , v n be elements of V which belong to the cosets v 2 , ■ ■ • > *n respectively. Then 
{v,v 2 , ...,v n } is a basis of V (Problem 10.26). Since f(v 2 ) - a 22 v 2 , we have 

f(v 2 ) - a 22 v 2 = 0 and so T(v 2 ) - a 22 v 2 G W 

But W is spanned by v; hence T(v 2 ) - a 22 v 2 is a multiple of v, say 

T(v 2 ) — a 22 v 2 = a 21 v and so T(v 2 ) = a 2l v + a 22 v 2 

Similarly, for i = 3 n, 

T(v { ) - a i2 v 2 - a i3 v 3 - ■ ■ ■ - e W and so T(v t ) = a n v + a i2 v 2 + • • • + a a v t 

Thus T(v) - a n v 

T(v 2 ) = a 2l v + a 22 v 2 

T(v n ) = a nl v + a n2 v 2 + ■ • • + a nn v n 
and hence the matrix of T in this basis is triangular. 



CYCLIC SUBSPACES, RATIONAL CANONICAL FORM 

10.29. Prove Theorem 10.12: Let Z(v, T) be a T-cyclic subspace, T v the restriction of T to 
Z{v,T), and m v (t) = t k + a^it*- 1 + ■ ■ • + Oo the T-annihilator of v. Then: 

(i) The set {v, T(v), T"' 1 ^)} is a basis of Z(v,T); hence dimZ(v,T) = k. 

(ii) The minimal polynomial of T„ is m v (t). 

(iii) The matrix of T v in the above basis is 
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(i) 



(ii) 



C = 



0 0 0 
10 0 



0 
0 



-a 0 
-at 



0 0 0 
0 0 0 



0 

1 



"ftfc-2 
-Ctk-l 



By definition of ra„(t), T k (v) is the first vector in the sequence v, T(v), T 2 (v), . . . which is a 
linear combination of those vectors which precede it in the sequence; hence the set 
B — {v,T(v), . . ., r fc_1 (i;)} is linearly independent. We now only have to show that Z(v, T) = 
L(B), the linear span of B. By the above, T k (v) G L(B). We prove by induction that 
T n (v) G L(B) for every n. Suppose n > k and T n ~ l (v) G L(B), i.e. 7' n_1 (v) is a linear com- 
bination of v, . ..,T k ~ 1 (v). Then T*(v) = T(T n ~ l (v)) is a linear combination of T(v), . . .,T k (v). 
But T k (v) G L(B); hence T n {v) G L(B) for every n. Consequently f(T){v) G L(B) for any 
polynomial f(t). Thus Z(v, T) = L(B) and so B is a basis as claimed. 



Suppose m(t) = t s + b,-^*- 1 + 
v G Z(v, T), 



• + b 0 is the minimal polynomial of T v . Then, since 

0 - m(T v )(v) = m(T)(v) = T s (v) + 6 s _j !T*-i(v) + ■■■ + b 0 v 

Thus T s (v) is a linear combination of »,?(■»),..., r s_1 (ii), and therefore k — s. However, 
m„(T) = 0 and so m v (T v ) = 0. Then m(t) divides m B (t) and so s — k. Accordingly k = s and 
hence m„(t) = m(t). 



(iii) 



r»(r(*)) 



T(v) 



T2(v) 



T v (T k -*(v)) = T k ~Hv) 
T V (T«-Hv)) = T k (v) = -aov - a t T{v) - a 2 T2(v) a M P-'(t>) 

By definition, the matrix of T v in this basis is the transpose of the matrix of coefficients of the 
above system of equations; hence it is C, as required. 



10.30. Let T : V -* V be linear. Let W be a T-invariant subspace of V and f the induced 
operator on V/W. Prove: (i) The T-annihilator of v G V divides the minimal poly- 
nomial of T. (ii) The f-annihilator of v e V/W divides the minimal polynomial of T. 

(i) The T-annihilator of v G V is the minimal polynomial of the restriction of T to Z{v, T) and 
therefore, by Problem 10.6, it divides the minimal polynomial of T. 

(ii) The f-annihilator of G V/W divides the minimal polynomial of t , which divides the minimal 
polynomial of T by Theorem 10.16. 

Remark. In case the minimal polynomial of T is /(t)« where f(t) is a monic irreducible poly- 
nomial, then the T-annihilator of v G V and the T-annihilator of v G V/W are of the form f(t) m 
where m — n. 



10.31. Prove Lemma 10.13: Let T : V -» V be a linear operator whose minimal polynomial 
is /(*)" where f(t) is a monic irreducible polynomial. Then V is the direct sum of 
T-cyclic subspaces Zi = Z(vi, T), i = 1, . . ,,r, with corresponding T-annihilators 

/(«)"», /(«)"■, . . . , /(t)*, n = m ^ w 2 ^ • • • s» «r 

Any other decomposition of V into the direct sum of T-cyclic subspaces has the same 
number of components and the same set of T-annihilators. 

The proof is by induction on the dimension of V. If dim V = 1, then V is itself T-cyclic and 
the lemma holds. Now suppose dim V > 1 and that the lemma holds for those vector spaces of 
dimension less than that of V. 
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Since the minimal polynomial of T is /(*)", there exists ^GF such that /(J 1 )" -1 (v x ) 0; 
hence the T-annihilator of v 1 is /(*)". Let Z i = Z(v u T) and recall that Z l is T-invariant. Let 
V = V/Z l and let t be the linear operator on V induced by T. By Theorem 10.16, the minimal poly- 
nomial of T divides /(<)"; hence the hypothesis holds for V and T. Consequently, by induction, V is 
the direct sum of T-cyclic subspaces; say, 

V = Z(v 2 , f ) 0 • • • © Z(%, f) 

where the corresponding T-annihilators are f(t) n 2, . . . , f(t)"r, n — n 2 — ■ • • — n r . 

We claim that there is a vector v 2 in the coset v 2 whose T-annihilator is /(<)"=>, the T-annihilator 
of v 2 . Let w be any vector in v 2 . Then f(T) n * (w) £ Z t . Hence there exists a polynomial g(t) for 
which 

f(T)«* (w) = fir(r)K) (i) 

Since /(t)" is the minimal polynomial of T, we have by (1), 

0 = = /(D«-».flr(D(i'i) 

But /(«)« is the T-annihilator of v t ; hence /(*)" divides /(t) n_n * #(*) and so ff(*) = f(t)^h(t) for 
some polynomial We set 

v 2 = it, - fc(T) (Vi) 

Since w - « 2 = ( v i) e ^l. ^2 also belongs to the coset $2- Thus the T-annihilator of v 2 is a 
multiple of the T-annihilator of v 2 . On the other hand, by (1), 

f{T)**{v 2 ) = /(D«»(«-*(D(i>i)) = /(DM») - fWW = o 

Consequently the T-annihilator of v 2 is /(t) n 2 as claimed. 

Similarly, there exist vectors v z , . . . , v r 6 V such that u { 6 v[ and that the T-annihilator of 
tfj is fit)**, the T-annihilator of ^. We set 

z 2 = z(v 2 ,T), .... z r = z(v„r) 

Let d denote the degree of f(t) so that /(<)"• has degree Then since f(t) n i is both the T-annihilator 
of Vj and the f-annihilator of vl we know that 

{v it T(v % ), T*H- 1 (Vi)} and f pTj), . . . , f**- 1 (v { )} 

are bases for Z{v^ T) and Z(W it f) respectively, for i = 2,...,r. But V = Z(vl t) ® ■ • • ® 
Z(vZ, f); hence . 

{« 2 , . . . , r*h>-» (« 2 ), ...,*„..., r^r-i (* r )> 

is a basis for V. Therefore by Problem 10.26 and the relation f\v) = T*(v) (see Problem 10.27), 

{v lt . .., T^i-i( Vl ), v 2 , . .., Td"*-i(v 2 ), ...,v r T*«r-i(v r )} 

is a basis for V. Thus by Theorem 10.4, V = Z(v lr T) © • • • © Z(v r , T), as required. 

It remains to show that the exponents n t , ...,n r are uniquely determined by T. Since d denotes 
the degree of f(t), 

dim V = d(n t +•••+«,) and dim Z i = dn it i - 1, . . . , r 

Also, if s is any positive integer then (Problem 10.59) /(T) s (^i) is a cyclic subspace generated by 
f(Ty(v t ) and it has dimension d(nj-s) if « s > s and dimension 0 if w { - s. 

Now any vector ti£V can be written uniquely in the form v = w x + • • • + w r where w 4 £ Z 4 . 
Hence any vector in /(T) S (V) can be written uniquely in the form 

f(T)*(v) = + • • • + /(T)»(w r ) 

where /(T) s (w { ) e f(Ty(Z^. Let t be the integer, dependent on s, for which 

% t > s, n t > s, «t+i - s 

Then /(D«(V) = f(T)*(Zi) © ••• © /(T) s (Z t ) 

and so dim (/(D* 00) = — *) + + («* — «)] (*) 

The numbers on the left of (*) are uniquely determined by T. Set s = n - 1 and (*) determines the 
number of rc 4 equal to n. Next set s = m-2 and (*) determines the number of w 4 (if any) equal to 
« - 1. We repeat the process until we set s = 0 and determine the number of n { equal to 1. Thus 
the n ; are uniquely determined by T and V, and the lemma is proved. 
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10.32. Let V be a vector space of dimension 7 over R, and let T : V -» V be a linear operator 
with minimal polynomial m(t) = (t 2 + 2)(t + 3) 3 . Find all the possible rational 
canonical forms for T. 

The sum of the degrees of the companion matrices must add up to 7. Also, one companion 
matrix must be t 2 + 2 and one must be (t + 3) 3 . Thus the rational canonical form of T is exactly one 
of the following direct sums of companion matrices: 

(i) C(t 2 + 2) © C(t2 + 2) © C((i + 3)3) 

(ii) C(t 2 + 2) © C((t + 3)3) © C((t + 3)2) 

(iii) C(t 2 + 2) © C((t + m © C(t + 3) © C(t + 3) 



That is, 



0 -2 



(i) 



0 -27 

0 -27 

1 -9 



'0 -2 
0 




(ii) 



0 -27 

0 -27 

1 -9 



(iii) 



PROJECTIONS 

10.33. Suppose V = Wi © • • • © W r . The projection of V into its subspace W k is the map- 
ping E : V -* V defined by E(v) — Wk where v = wi + • • • + w r , Wi G Wi. Show 
that (i) E is linear, (ii) E 2 = E. 

(i) Since the sum v = w x + • • • + w r , w { G W is uniquely determined by v, the mapping E is well 
defined. Suppose, for u&V, u — w[ + • • • + w' r , w[ e W K . Then 

v + u = {w l + w' l )+ ■•• + (w r + wj) and kv = fcwj + ••• + kw r , kw i ,w i J rw' i £ 
are the unique sums corresponding to v + u and fcv. Hence 

E(v + u) = w fc + w' k - E(v) + E(u) and E(kv) = kw k = kE(v) 
and therefore E is linear. 

(ii) We have that w k = 0+---+0 + w fc + 0+ -- -+ 0 

is the unique sum corresponding to w k G W k ; hence E(w k ) = w k . Then for any t)£7, 

E 2 (v) - E(E(v)) - E(w k ) = w k = E{v) 
Thus E 2 = E, as required. 



10.34. Suppose E : V -» V is linear and E 2 = E. Show that: (i) E(u) = u for any u&lmE, 
i.e. the restriction of E to its image is the identity mapping; (ii) V is the direct sum 
of the image and kernel of E: V = ImE © Ker E; (iii) E is the projection of V into 
Im E, its image. Thus, by the preceding problem, a linear mapping T : V -* V is a 
projection if and only if T 2 = T; this characterization of a projection is frequently 
used as its definition. 

(i) If m G Im E, then there exists v G V for which E{v) = u; hence 

E(u) = E(E(v)) = EHv) - E(v) = u 

as required. 

(ii) Let v G y. We can write v in the form v = E(v) + v — #(i>). Now £"(t)) G ImE and, since 

(v - E(v)) = E(v) - E 2 (v) = E(v) - E(v) = 0 
v — E(v) G Ker E. Accordingly, V = Im E + Ker £7. 
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Now suppose w GlmE nKerE. By (i), E(w) = w because wElmE. On the other 
hand, E(w) — 0 because w G Ker 2?. Thus w = 0 and so ImBn Ker E = {0}. These two 
conditions imply that V is the direct sum of the image and kernel of E. 

(iii) Let t£V and suppose v = u + w where «elm£ and w> G Ker E. Note that S(w) = u 
by (i), and E(w) — 0 because to G Ker 2?. Hence 

2J(v) = E(u + w) - E(u) + E(w) = u + 0 = m 
That is, E is the projection of V into its image. 



10.35. Suppose V = U®W and suppose T : V •* V is linear. Show that U and W are both 
T-invariant if and only if TE = ET where E is the projection of V into U. 

Observe that E(v) G U for every v G V, and that (i) 2?(v) = v iff v G 17, (ii) E(v) = 0 
iff »£f. 

Suppose ET = TE. Let u G U. Since E(m) = u, 

r(w) = = (TE)(u) = (ET)(u) = B(r(w)) G 17 

Hence [7 is T-invariant. Now let w G W. Since = 0, 

E(T(w)) = (ET)(w) = (TE)(w) = T(E(w)) = T(0) = 0 and so r(w) G W 

Hence W is also T-invariant. 

Conversely, suppose U and W are both T-invariant. Let «£V and suppose v = w + w where 
u G T and weF. Then T(m) G t7 and T(w) G hence B(T(w)) = T(u) and E(r(w)) = 0. 

Thus (ET)(v) = (ET)(u + w) = (sdm + («r)(») = ff(r(u» + B(r(w» = r(«) 

and (TS)(o;) = (TE)(u + w) = T(E(u + w)) = T(u) 

That is, (ET)(v) - (TE)(v) for every v G V; therefore ET = as required. 



Supplementary Problems 

INVARIANT SUBSPACES 

10.36. Suppose W is invariant under T : V -> V. Show that W is invariant under /(T) for any polynomial 
/(*)■ 

10.37. Show that every subspace of V is invariant under / and 0, the identity and zero operators. 

10.38. Suppose W is invariant under S : V -* V and T : V -» V. Show that W is also invariant under 
S + T and ST. 

10.39. Let T : V -» V be linear and let W be the eigenspace belonging to an eigenvalue X of T. Show that 
W is T-invariant. 

10.40. Let V be a vector space of odd dimension (greater than 1) over the real field E. Show that any 
linear operator on V has an invariant subspace other than V or {0}. 



10.41. Determine the invariant subspaces of A — 




viewed as a linear operator on (i) R 2 , (ii) C 2 . 



10.42. Suppose dim V = n. Show that T : V -> V has a triangular matrix representation if and only 
there exist T-invariant subspaces W t c W 2 c • • • c W n = V for which dim W k - k, k - 1, . . . , 



INVARIANT DIRECT-SUMS 

10 43 The subspaces W lt ...,W r are said to be independent if w t + • • • + w r = 0, w i G implies that 
each « 4 = 0. Show that L(Wd = W t 0 ■ ■ • © W r if and only if the W { are independent. (Here 
L(Wi) denotes the linear span of the W^) 

10.44. Show that V = W t ® ■•• ® W r if and only if (i) V = L(W { ) and (ii) W k n L(W U . . .,W k - lt 
W k + 1 ,...,W r ) = {0}, k = 1, ...,r. 

10.45. Show that L(Wt) = ■■■ ®W r if and only if dim L(W t ) = dim PF X + • • • + dim W r . 
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10.46. Suppose the characteristic polynomial of T : V -» V is A(t) = / 2 (t)»* . . . / r (t)"r where the 
fi(t) are distinct monic irreducible polynomials. Let V = W t © • • • © W r be the primary decom- 
position of V into ^-invariant subspaces. Show that f t (t) n i is the characteristic polynomial of the 
restriction of T to W { . 

NILPOTENT OPERATORS 

10.47. Suppose S and T are nilpotent operators which commute, i.e. ST = TS. Show that S+T and ST 
are also nilpotent. 

10.48. Suppose A is a supertriangular matrix, i.e. all entries on and below the main diagonal are 0. Show 
that A is nilpotent. 

10.49. Let V be the vector space of polynomials of degree - n. Show that the differential operator on V 
is nilpotent of index n + 1. 

10.50. Show that the following nilpotent matrices of order n are similar: 

/0 1 0 ... 0\ /0 0 ... 



0 0 1 



1 0 
0 1 




and 

0 0 0 

\0 o o ... o/ \o 0 

10.51. Show that two nilpotent matrices of order 3 are similar if and only if they have the same index 
of nilpotency. Show by example that the statement is not true for nilpotent matrices of order 4. 

JORDAN CANONICAL FORM 

10.52. Find all possible Jordan canonical forms for those matrices whose characteristic polynomial A(t) 
and minimal polynomial m(t) are as follows: 

(i) A(t) - («-2)4(t-3)2, m(t) = (t-2) 2 (t-3) 2 

(ii) A(t) = (*-7)5, m(t) = (t-7) 2 

(iii) A(i) = (t-2) 7 , m(t) = (t-2)3 

(iv) A(t) = (t-8)*(t-5)*, m(t) = (t-S)*(t-5)* 

10.53. Show that every complex matrix is similar to its transpose. (Hint. Use Jordan canonical form and 
Problem 10.50.) 

10.54. Show that all complex matrices A of order n for which A n = I are similar. 

10.55. Suppose A is a complex matrix with only real eigenvalues. Show that A is similar to a matrix with 
only real entries. 

CYCLIC SUBSPACES 

10.56. Suppose T : V -* V is linear. Prove that Z(v, T) is the intersection of all ^-invariant subspaces 
containing v. 

10.57. Let fit) and g{t) be the T-annihilators of u and v respectively. Show that if f{t) and g(t) are rel- 
atively prime, then f(t)g(t) is the T-annihilator of w + v. 

10.58. Prove that Z(u, T) = Z(v, T) if and only if g(T)(u) - v where g(t) is relatively prime to the 
T-annihilator of u. 

10.59. Let W = Z(v, T), and suppose the T-annihilator of v is f(t) n where fit) is a monic irreducible poly- 
nomial of degree d. Show that f(T) s (W) is a cyclic subspace generated by f(T) s (v) and it has dimen- 
sion d(n — s) if n > s and dimension 0 if n — s. 

RATIONAL CANONICAL FORM 

10.60. Find all possible rational canonical forms for: 

(i) 6X6 matrices with minimum polynomial m(t) = (t 2 + 3)(t + l) 2 

(ii) 6X6 matrices with minimum polynomial m(t) = (t + l) 3 

(iii) 8 X 8 matrices with minimum polynomial m(t) — (t 2 + 2) 2 (t + 3) 2 

10.61. Let A be a 4 X 4 matrix with minimum polynomial m(t) = (t 2 + l)(t 2 — 3). Find the rational ca- 
nonical form for A if A is a matrix over (i) the rational field Q, (ii) the real field R, (iii) the com- 
plex field C. 
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10.62. Find the rational canonical form for the Jordan block 



j\ 1 o o\ 
0X10 
0 0X1 

\o 0 0 X/ 



10.63. Prove that the characteristic polynomial of an operator T : V -» V is a product of its elementary- 
divisors. 

10.64. Prove that two 3X3 matrices with the same minimum and characteristic polynomials are similar. 

10.65. Let C(f(t)) denote the companion matrix to an arbitrary polynomial f(t). Show that f(t) is the char- 
acteristic polynomial of C(/(t)). 

PROJECTIONS 

10.66. Suppose V = W t © • • • © W r . Let E x denote the projection of V into W t . Prove: (i) E f Ej = 0, 
i¥=j\ (ii) / = #!+••• +E r . 

10.67. Let E u .. .,E r be linear operators on V such that: (i) Ef = E u i.e. the E i are projections; 
(ii) E i E i = 0, i ¥> j; (iii) I = E 1 H +E r . Prove that V = ImBj © ■ • • © ImE r . 

10.68. Suppose E : V -> V is a projection, i.e. = 2?. Prove that E has a matrix representation of the 
form ^* ^ where r is the rank of E and 7 r is the r-square identity matrix. 

10.69. Prove that any two projections of the same rank are similar. (Hint. Use the result of Problem 
10.68.) 

10.70. Suppose E : V -» V is a projection. Prove: 

(i) /- E is a projection and V = ImE © Im {I-E); (ii) I + E is invertible (if 1 + 1^0). 

QUOTIENT SPACES 

10.71. Let W be a subspace of V. Suppose the set of cosets {vj + W, v 2 + W, ...,v n +W} in V/W is 
linearly independent. Show that the set of vectors {v u v 2 , . . . , v n } in V is also linearly independent. 

10.72. Let IF be a subspace of V. Suppose the set of vectors {w„ w 2 uj in V is linearly independent, 

and that L(u { ) n W = {0}. Show that the set of cosets {u t + W, . . . , u n + W} in V/W is also 
linearly independent. 

10.73. Suppose V = U © W and that {u u ...,«„} is a basis of U. Show that 0^ + W, . . ., u n + W} is 
a basis of the quotient space V/W. (Observe that no condition is placed on the dimensionality of 
V or W.) 

10.74. Let W be the solution space of the linear equation 

djXj + a 2 %2 + • • • + Mn = 0, a t S K 

and let v - (b lt b 2 6 n ) € K". Prove that the coset v + W of W in K« is the solution set of the 

linear equation + + . . . + ana . n = 5 where 6 = a 1 6 1 + • • • + a n b n 

10.75. Let V be the vector space of polynomials over R and let W be the subspace of polynomials divisible 
by t*, i.e. of the form a 0 i 4 + a^ 5 + 1- a„- 4 i n . Show that the quotient space V/W is of dimension 4. 

10.76. Let U and W be subspaces of V such that W C J7 C V. Note that any coset u+WoiWinU may 
also be viewed as a coset of W in V since a£P implies u 6 V; hence t//W is a subset of WW. 
Prove that (i) U/W is a subspace of V/FP, (ii) dim(V/W) - dim (U/W) = &im(V/U). 

10.77. Let U and W be subspaces of V. Show that the cosets of U n W in V can be obtained by inter- 
secting each of the cosets of U in V by each of the cosets of W in V: 

V/(Un W) = {(v + U) n (v' +W): v, v' e V} 

10.78. Let T : V -* V be linear with kernel W and image U. Show that 
the quotient space V/W is isomorphic to U under the mapping 
e : V/W -» 17 denned by fl(i> + = Furthermore, show that 
T = i o 0 o tj where ij : V -* V/W 7 is the natural mapping of V into 
V/W, i.e. i?(t>) = v + W, and i : U c V is the inclusion mapping, y/;y 
i.e. i(u) = u. (See diagram.) 
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Answers to Supplementary Problems 

10.41. (i) R2 and {0} (ii) C*, {0}, = L((2, 1 - 2i)), W 2 = L((2, 1 + 2t)) • 



10.52. (i) 



2 1 | 
2 I 



\ 



I 2 1 



I 



2 I 



\ 



/ 



/'i 



— tii_ 



\ 



LlL 



3 1 



(ii) 



7 1 



7 I 



7 1 



I — -I 

I 7 / 



111 



7 i 



(iii) 



2 1 
2 1 
2 



2 1 
2 1 



I 2 



7 



2 1 
2 1 
2 



2 1 



\ 



l_£4 — 

12 1/ 
I 2 / 



2 1 I 
2 1 I 
2 I 

r 



\ 



i 



2 1 I 
2 I 



T i" 1 



2 1 I 
2 1 I 
2 I 



\ 



i 2 1 

I 2 



(iv) 



3 1 I 
__3 I 

I 



3 1 I 



3 I 



I 



\ 

I 5 1 
I 5 



3 1 I 



3 1 I 
3 I 



I 



5 1 | 
L 5 ' 

r t-i 



3 1 I 
3 I 



L 3 _l 



LLL 



5 1 



i 



4 

I 5 1 



3 1 I 
3 I 

"--i-r-i 



L - 1- - 1 



L 3 1 

L 5 _ 
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10.60. (i) 



(ii) 



(iii) 




10.61. (i) 




(ii) 



/0 -1 



-Vs/ 



(iii) 



—i 



10.62. 



(0 0 0 -\ 4 \ 

1 0 0 4\3 

0 10 -6\ 2 

\0 0 1 4X / 



Chapter 11 



Linear Functionals and the Dual Space 

INTRODUCTION 

In this chapter we study linear mappings from a vector space V into its field K of scalars. 
(Unless otherwise stated or implied, we view K as a vector space over itself.) Naturally 
all the theorems and results for arbitrary linear mappings on V hold for this special case. 
However, we treat these mappings separately because of their fundamental importance and 
because the special relationship of V to K gives rise to new notions and results which do not 
apply in the general case. 



LINEAR FUNCTIONALS AND THE DUAL SPACE 

Let V be a vector space over a field K. A mapping 4, : V -» K is termed a linear func- 
tional (or linear form) if, for every u, v G V and every a,b GK, 

<j>(au + bv) — a<f,(u) + b<j>(v) 

In other words, a linear functional on V is a linear mapping from V into K. 

Example 11.1: Let v t : K n -* K be the ith projection mapping, i.e. ir^a^ a 2 , . . . , o„) = a 4 . Then ttj 
is linear and so it is a linear functional on K n . 

Example 11.2: Let V be the vector space of polynomials in t over R. Let JJ : V -» R be the integral 

operator denned by ^(p(t)) = I p(t) dt. Recall that $ is linear; and hence it is 
a linear functional on V. o 

Example 11.3: Let V be the vector space of w-square matrices over K. Let T : V -» K be the trace 
mapping 

T(A) = a n + o 22 + • • ■ + a nn , where A = (o (j ) 

That is, T assigns to a matrix A the sum of its diagonal elements. This map is 
linear (Problem 11.27) and so it is a linear functional on V. 

By Theorem 6.6, the set of linear functionals on a vector space V over a field K is also 
a vector space over K with addition and scalar multiplication defined by 

(4> + o-)(v) = (f>(v) + o-(v) and {k<j>)(v) = k<j>(v) 

where <j> and a are linear functionals on V and k G K. This space is called the dual space of 
V and is denoted by V*. 

Example 11.4: Let V = K n , the vector space of m-tuples which we write as column vectors. Then 
the dual space V* can be identified with the space of row vectors. In particular, 
any linear functional <f> = K, . . . , a n ) in V* has the representation 

<t>(x x , . . . , x n ) — (a u a 2 , . . . , a n )\ 
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or simply 

«„) = + <h, x Z + • • • + »A 

Historically, the above formal expression was termed a linear form. 



DUAL BASIS 

Suppose V is a vector space of dimension n over K. By Theorem 6.7, the dimension of 
the dual space V* is also n (since K is of dimension 1 over itself.) In fact, each basis of V 
determines a basis of V* as follows: 

Theorem 11.1: Suppose {Vi, . . .,v n } is a basis of V over K. Let ^, . . G V* be the 
linear functionals denned by 

M = 8 "' = jo if i + i 

Then {4> l( . . .,<j>J is a basis of V*. 

The above basis {^} is termed the basis dual to {Vi} or the dual basis. The above for- 
mula which uses the Kronecker delta 8 y is a short way of writing 

^(vj = i, = o, ^og = 0, .... ^ t (v„) - 0 
= 0, 4> 2 (« 2 ) = 1. <M*> 3 ) = 0, • • •» <M*0 = 0 

*„(»,) = o. = °» = o. *.(«■) = 1 

By Theorem 6.2, these linear mappings 0, are unique and well defined. 

Example 11.5: Consider the following basis of R 2 : {fj = (2,1), v 2 = (3,1)}- Find the dual basis 
We seek linear functionals <f>i(x, y) = ax + by and 0 2 (*> ») = «* + dy such that 

^(v,) = 1, 0i(v 2 ) = 0, * 2 (vi) = 0, 0 2 (v 2 ) = 1 



Thus 



0l(Vl) 


= #i(2,l) 


= 2a + b = 




01(" 2 ) 


= <M3,1) 


= 3a + b = 


0 




= * 2 (2,D 


= 2c + d = 


0 




= ^2(3,1) 


- 3c + d = 





or a = —1, 6 = 3 



or c = 1, d = — 2 

Hence the dual basis is {<t>i(x,y) = — * + 3j/, 0 2 (*,J/) = x-2y}. 
The next theorems give relationships between bases and their duals. 

Theorem 11.2: Let {vi, . . . , v n ) be a basis of V and let ...,.£„} be the dual basis of V*. 
Then for any vector uGV, 

u = <t >l {u)v 1 + 4> 2 (u)v 2 + • • • + 
and, for any linear functional a G V*, 

or = ^V,)^ + <r(v 2 )£ 2 + • • • + 

Theorem 11.3: Let {vi, ...,«»} and {wi, ...,w n ) be bases of V and let ...,<£„} and 
{o-j, . . . , o-J be the bases of V* dual to and {wi} respectively. Suppose 
P is the transition matrix from to {wi}. Then (P" 1 )' is the transition 
matrix from {0j to {o-.}. 
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SECOND DUAL SPACE 

We repeat: every vector space V has a dual space V* which consists of all the linear 
functionals on V. Thus V* itself has a dual space V**, called the second dual of V, which 
consists of all the linear functionals on V*. 

We now show that each v G V determines a specific element v G V**. First of all, 
for any </> G V* we define A 

v(4>) = <l>(v) 

It remains to be shown that this map v : V* -> K is linear. For any scalars a, 6 G K and 
any linear functionals <j>,<r G V*, we have 

v{a4> + b<j) = (a<f> + b<r)(v) = a <f,(v) + b a(v) = a + & v(<r) 
That is, v is linear and so vGV**. The following theorem applies. 

Theorem 11.4: If V has finite dimension, then the mapping v is an isomorphism of V 
onto V**. 

>\ 

The above mapping v h» v is called the natural mapping of F into 7**. We emphasize 
that this mapping is never onto V** if V is not finite-dimensional. However, it is always 
linear and, moreover, it is always one-to-one. 

Now suppose V does have finite dimension. By the above theorem the natural mapping 
determines an isomorphism between V and V**. Unless otherwise stated we shall identify 
V with V** by this mapping. Accordingly we shall view V as the space of linear functionals 
on V* and shall write V = V**. We remark that if {<£.} is the basis of V* dual to a basis 
{Vi} of V, then {Vi} is the basis oi V = V** which is dual to {<£,}. 



ANNIHILATORS 

Let W be a subset (not necessarily a subspace) of a vector space V. A linear functional 
<j> G V* is called an annihilator of W if <j>(w) = 0 for every w G W, i.e. if <j>(W) = {0}. 
We show that the set of all such mappings, denoted by W° and called the annihilator of W, 
is a subspace of V*. Clearly 0 G W°. Now suppose <£, o G IF 0 . Then, for any scalars 
a,b GK and for any w G W , 

{a$ + b<T){w) = a<j>{w) + bv{w) - aO + bO = 0 
Thus a^ + baG W° and so W° is a subspace of V*. 

In the case that IF is a subspace of V, we have the following relationship between W and 
its annihilator W°. 

Theorem 11.5: Suppose V has finite dimension and W is a subspace of V. Then 
(i) dim W + dim W° = dim V and (ii) W 00 = W. 

Here W 00 = {v G F : = 0 for every ^ G IF 0 } or, equivalently, IF 00 = (IF 0 ) 0 where 
IF 00 is viewed as a subspace of F under the identification of F and F**. 

The concept of an annihilator enables us to give another interpretation of a homogeneous 
system of linear equations, 

auXi + ai 2 #2 + • • • + ainXn = 0 

a 21 xi + a 22 x 2 + • ■ • + a 2 nX n =0 . , 



a m lXi + Om 2 X 2 + • ■ • + amnXn — 0 
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Here each row (an, a&, . . ., a ir ) of the coefficient matrix A = (ay) is viewed as an element 
of K" and each solution vector <j> = (xi, X2, . . . , x n ) is viewed as an element of the dual space. 
In this context, the solution space S of (*) is the annihilator of the rows of A and hence of 
the row space of A. Consequently, using Theorem 11.5, we again obtain the following 
fundamental result on the dimension of the solution space of a homogeneous system of 
linear equations: 

dimS = dimK n - dim (row space of A) = n — rank (A) 

TRANSPOSE OF A LINEAR MAPPING 

Let T :V -* U be an arbitrary linear mapping from a vector space V into a vector space 
U. Now for any linear functional ^ G U*, the composition <j> o T is a linear mapping from 
V into K: 




That is, <j>° T G V*. Thus the correspondence 

is a mapping from U* into V*; we denote it by T* and call it the transpose of T. In other 
words, T*:U*-* V* is denned by 

Thus (2 ,t (^,))(v) = <f>(T(v)) for every v € V. 

Theorem 11.6: The transpose mapping T* denned above is linear. 

Proof. For any scalars a,b G K and any linear functionals <$>, a G f/*, 
T'iM + bfr) = (a<j> + b<r)oT = a(</>°T) + b(aoT) 
= a T*(<j,) + b T<( a ) 

That is, T l is linear as claimed. 

We emphasize that if T is a linear mapping from V into U, then T* is a linear mapping 
from U* into V*: T Tt 

The name "transpose" for the mapping T* no doubt derives from the following theorem. 

Theorem 11.7: Let T : V -» U be linear, and let A be the matrix representation of T rel- 
ative to bases {Vi} of V and {im} of C/. Then the transpose matrix A* is 
the matrix representation of T*:U*-* V* relative to the bases dual to 
{Mi} and {Vi}. 
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Solved Problems 

DUAL SPACES AND BASES 

11.1. Let 4> : R 2 -* R and a : R 2 -» R be the linear f unctionals denned by <f>(x, y) = x + 2y 
and <t(x, y) - 3x- y. Find (i) 4. + <r, (ii) 4<£, (iii) 2<f> - 5<r. 

(i) (<f> + a)(x,y) = i>(x, y) + o(x, y) = x + 2y + Zx - y = 4x + y 

(ii) (40) (x, y) = 4<t>(x,y) = 4(x + 2y) = 4x + Sy 

(iii) (2$~5 a )(x,y) = 2 0(b, y) - 5 <r(x, y) = 2(x + 2y) - 5(3* - y) = -13* + 9y 



11.2. Consider the following basis of R 3 : {vi = (1,-1,3), v 2 = (0,1,-1), vs = (0,3,-2)}. 
Find the dual basis {<f> v <£ 2 , <f> 3 ). 

We seek linear functionals 

4>i(«, v, «) = + o-2V + «3 Z » <t>2( x , v< z ) = M + hv + h z > v, «) = «i* + C 2V + e s z 

such that sMfi) = 1 0iO>2) = 0 01(^3) = 0 

0 2 («i) = 0 <p 2 (v 2 ) = 1 <f> 2 (v s ) = 0 

08(%) = 0 <*3(*>2> = 0 0 3 (l> 3 ) = 1 

We And 0j as follows: 

0iK) = 0i(l> -1. 3) = ai - » 2 + 3a 3 = 1 
0i(v 2 ) = 0 1 (O,1,-1) = a 2 - a 3 = 0 

0i(«s) = 0i(O, 3, -2) = 3o 2 - 2a 3 = 0 
Solving the system of equations, we obtain <*! = 1, a 2 = 0, o 3 = 0. Thus <l>i(x, y, z) = x. 
We next find 0 2 : 

0 2 (<>i) = 02(1.-1.3) = 6j-6 2 +3b 3 = 0 
0 2 K) = 0 2 (O,1,-1) = 6 2 - 6 3 = 1 

02(^3) = 0 2 (O,3,-2) = 36 2 - 26 3 = 0 
Solving the system, we obtain b 1 = 7, b 2 = —2, 6 3 — —3. Hence </> 2 (x, y, z) — Ix — 2y — 3z. 
Finally, we find 0 3 : 

0 3 («i) = 0 3 (1. -1. 3) = Cj - c 2 + 3c 3 = 0 
03(^2) = 03(0, 1, -1) = c 2 - c 3 = 0 

0 3 (v 3 ) = 0 3 (O,3,-2) = 3c 2 -2c 3 = 1 
Solving the system, we obtain Cj = —2, c 2 = 1, c 3 = 1. Thus 0 3 (x, y, z) = —2x + y + z. 



11.3. Let V be the vector space of polynomials over R of degree — 1, i.e. V = 
{a + bt: a,b £ R}. Let ^V-^R and <£ 2 :V-»R be denned by 

= Cf(t)dt and 4> 2 (f(t)) = Cf{t)dt 

(We remark that ^ and <f> 2 are linear and so belong to the dual space V*.) Find the 
basis {vi, v 2 } of V which is dual to {<f> v <£ 2 }. 

Let v l = a + bt and v 2 = c + dt. By definition of the dual basis, 

0!(vj) = 1, 02(1;!) = 0 and 0i(v 2 ) = 0, <fi 2 (v 2 ) = 1 

Thus r i 

01 ( Vl ) = I ( a +bt)dt = « + = 1 

•'n 



0 2 (' 



*i) = f (a 



+ bt) dt = 2a + 2b = 0 



or a = 2, 6 = -2 
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or c = —A, d = 1 



0j( V2 ) = f (c + dt)dt - c + ^d - 0 

02(f 2 ) = f (e + dt) dt = 2c + 2d = 1 
In other words, {2 — 2t, + t} is the basis of V which is dual to {</> it <f> 2 }. 

11.4. Prove Theorem 11.1: Suppose {vi, . . . , v n ) is a basis of V over K. Let </>j, . . . , <j> n G V 
be the linear functionals denned by 

fl if i = 3 

d>.(v.) = S.. - < 

Then {<j> v . . . , <£ n } is a basis of V*. 

We first show that {<j> lt ...,*„} spans V*. Let 0 be an arbitrary element of V*, and suppose 

= *1, *(« 2 ) = fc 2» • • •. <t>( v n) = & n 

Set <r = Ml + • • • + k n <f, n . Then 

<r(l>l) = (Ml + • • • + Mn)("i) 

= ^^Vx) + fc 2 02(1;!) + • • • + K 0„(Vj) 

= fc x «l + fc 2 «0 + ••• + fc„'0 = fcj 

Similarly, for i = 2, . . . , «, 

<r(«i) = (Mi + • • • + 

= M>l( v i) + ••• + fci0i(v 4 ) + ••• + fc„0n(^i) = h 

Thus 0(0)1) = a(v { ) for i = 1 n. Since 0 and a agree on the basis vectors, $ = o = 

Mi + • • • + fc n <(>rr Accordingly, {<f> u ...,<(>„} spans V*. 

It remains to be shown that {<p v . . . , <p n } is linearly independent. Suppose 

atfi + a 2 0 2 + • • • + a„<t> n ' 0 
Applying both sides to v u we obtain 

0 = 0(170 = (a,0! + • • • + a n 0 n )(v!) 

= aj^^i;!) + O2 0 2 ( v l) + ••• + *»0n( v l) 

= ai • 1 + a 2 • 0 + ■ • • + a n • 0 = »! 

Similarly, for i = 2, . . . , n, 

0 = 0(u 4 ) = (Oj0i + • • • + a n 0„)(v ; ) 

= »1 0l(Vi) + • • • + «i #i(Vj) + • • • + <*n 0n( v i) = a i 

That is, aj = 0, . . .,a n = 0. Hence { 01 </> n ) is linearly independent and so it is a basis of V*. 

11.5. Prove Theorem 11.2: Let {v u ...,«„} be a basis of V and let {<j> v ...,<£„} be the dual 
basis of V*. Then, for any vector u G V, 

u = 4> 1 (u)v 1 + 4> 2 (u)v 2 + • • • + 4>>K, CO 

and, for any linear functional a G V*, 

a = dvfa + «(v 2 )<l> 2 + ■■■ + <j(v n )<i> n (2) 

Suppose u — a 1 v l + a 2 v 2 + • • • + a n v n (5) 

Then 

01 ( M ) = a 1 0 1 (v 1 ) + o 2 0j(v 2 ) + •■• + «„#i(««) = o, t -l + a 2 -0 + ••• + a n -0 = aj 
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Similarly, for i = 2, . . . , n, 

<j>i(u) = a t 0 i (u 1 ) + • • • + Oj ^(vi) + ■■■ + a n ^{v n ) = ttj 
That is, 0j(m) = a u 0 2 (w) = a 2 , <fi n (u) = a„. Substituting these results into (3), we obtain (1). 
Next we prove (2). Applying the linear functional a to both sides of (1), 
o(u) = 0 1 (m)<t(v 1 ) + </> 2 (u)o(v 2 ) + ••• + 0„(w) <r{v„) 
= a(Vj) 0j(tt) + <j(v 2 ) 0 2 (it) + • ■ • + o(v n ) <p n (u) 
= (<r(t>i)*i + <j(v 2 )<f> 2 + • • - + o{v„)<f> n )(u) 
Since the above holds for every uGV, a — <rO>i)0i + o(v 2 )<p 2 + • • • + o(v n )<j> n as claimed. 



11.6. Prove Theorem 11.3: Let {v u ...,v„} and {w u ...,w n } be bases of V and let 
{<t> lf ■ ■ ■ , <j> n } and {a v . . . , a n } be the bases of V* dual to {vi} and {wi} respectively. 
Suppose P is the transition matrix from {vt} to {wi}. Then (P -1 )' is the transition 
matrix from {^J to {o-J. 

Suppose 

w l - «11 V 1 + «12«2 + • • ■ + d ln V n a, = ftj^j + 6 12 0 2 + • • ■ + &i„0„ 

w 2 = a 2l v t + a 22 v 2 + • ■ • + a 2n v n a 2 = 6 2 i0i + b 22 4> 2 + • ■ • + 6 2n 0 n 



w n - a„i^i + a n2 v 2 + • ■ • + a nn v„ a n = b nl ^ + & n2 0 2 + ■■■ + & nn 0„ 

where P = (a i} ) and Q = (by). We seek to prove that Q = (P" 1 )*. 

Let Ri denote the ith row of Q and let Cj denote the jth column of P l . Then 
R i = *> i2 , . .., b ln ) and C s = (a jv a j2 , . . ., a jn ) f 
By definition of the dual basis, 

<ri(w,) - (6 U 0! + 6 i2 0 2 + • • • + + a j2 v 2 + ■■■ + a jn v n ) 

= b n a }1 + b a a i2 + • • • + b in a jn = R t C } - 8^ 
where S is is the Kronecker delta. Thus 

/flA «,C 2 ... R n C n \ 1 1 0 ... 

Qpt — [ R2C1 RiC 2 . . . R 2 C n _ 0 1... 

\B n Ci R n C 2 ... RjcJ \0 0 ... 

and hence Q = (P«)-i = (P~i)t as claimed. 



= 7 



11.7. Suppose V has finite dimension. Show that if v G V, v # 0, then there exists 
<t> e V* such that <f>(v) ¥° 0. 

We extend {v} to a basis {v,v 2 v n } of V. By Theorem 6.1, there exists a unique linear 

mapping 0 : V -* K such that <p(v) - 1 and 0(1^) = 0, i = 2, ...,n. Hence 0 has the desired 
property. 



11.8. Prove Theorem 11.4: If V has finite dimension, then the mapping v v is an 
isomorphism of V onto V**. (Here v : V* -* K is defined by v(<j>) = <j>(v).) 

We first prove that the map v h> -y is linear, i.e. for any vectors D,»e V and any scalars 
a, 6 E jK", av+Tra; = a? + bw. For any linear functional 0 G V*, 

oS"+T>w (0) = 0(01; + 6w) = a <t>(v) + b 0(w) 

= 0^(0) + 6w(0) = {av + 6w)(0) 

Since SiT+~Sw(0) = (av + 6w)(0) for every 0 G V*, we have SlT+Tw = 6w. Thus the 

map v l-» v is linear. 
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Now suppose v G V, v 0. Then, by the preceding problem, there exists 0 G V* for which 
0(v) # 0. Hence v (0) - </,(v) # 0 and thus v 0. Since v # 0 implies v # 0, the map v H- 1> 
is nonsingular and hence an isomorphism (Theorem 6.5). 

Now dim V = dim V* = dim V** because V has finite dimension. Accordingly, mapping v v 
is an isomorphism of V onto V**. 



ANNIHILATORS 

11.9. Show that if G V* annihilates a subset S of V, then <j> annihilates the linear span 
L(S) of S. Hence S° = (L(S))°. 

Suppose v G L(S). Then there exist w u . . ., w r G S for which v = + a 2 w 2 + ■ ■ • + a r w r . 

0(u) = a!0(w!) + a 2 0(w 2 ) + ••• + a r </>(w r ) = c^O + a 2 0 + ■ • • + a r 0 = 0 
Since v was an arbitrary element of L(S), <j> annihilates L(S) as claimed. 



11.10. Let W be the subspace of R 4 spanned by vi = (1, 2, -3, 4) and w 2 = (0, 1, 4, -1). Find 
a basis of the annihilator of W. 

By the preceding problem, it suffices to find a basis of the set of linear functional <f{x, y, z, w) = 
ax + by + cz + dw for which </>(v t ) = 0 and <fi(v 2 ) = 0: 

0(1,2,-3,4) = a + 2b-3c + U = 0 
0(0,1,4,-1) = 6 + 4c-d = 0 

The system of equations in unknowns a, b, c, d is in echelon form with free variables c and d. 

Set c = 1, d = 0 to obtain the solution a = 11, 6 = -4, c = 1, d = 0 and hence the linear func- 
tional 0i(«, J/, z, w) — llx — 4y + z. 

Set c = 0, d = -1 to obtain the solution a = 6, 6 = -1, e = 0, d = -1 and hence the linear func- 
tional 0 2 («, V,z,w) = 6a; — j/ — w. 

The set of linear functionals {<t> lt 0 2 } is a basis of W°, the annihilator of W. 



11.11. Show that: (i) for any subset S of V, S CS 00 ; (ii) if Si CS 2 , then S° cS?. 

(i) Let vGS. Then for every linear functional 0 G S°, v(0) = 0(v) = 0. Hence i> G (S°)°. 
Therefore, under the identification of V and V**, t)E S 00 . Accordingly, S C S 00 . 

(ii) Let 0GS 2 . Then 0(v) = 0 for every v G S 2 . But SiCS 2 ; hence 0 annihilates every ele- 
ment of S v i.e. 0 G Si- Therefore SiJcSj. 



11.12. Prove Theorem 11.5: Suppose V has finite dimension and W is a subspace of V. 
Then (i) dim W + dim W° = dim V and (ii) TP 00 = W. 

(i) Suppose dim V = n and dim = r ^ «. We want to show that dim W° = n - r. We choose 

a basis {w 1( . . . , w T } of W and extend it to the following basis of V: {w u *>n-r>- 

Consider the dual basis , x 

101, . . . , 0 r , ff 1( . . . , CT n - r > 

By definition of the dual basis, each of the above <r's annihilates each w { ; hence 

ffl a n _ r e W<>. We claim that {a^} is a basis of W°. Now {a,} is part of a basis of V* and 

so it is linearly independent. 

We next show that {oj} spans W°. Let <r G By Theorem 11.2, 

a — a(Wi)0i + • • • + <r(M> r )0 r + a(t>i)ffi + ■ • • + a(t> n _ r )a B _ r 

- 00! + • • • + O0 r + <A?>j)oi + ■ • ■ + <K"«-rVn-r 

= °{V\)<>\ + • • • + a(v„_ r )or B _ r 

Thus {<r t <r n -r} spans TP 0 and so it is a basis of W°. Accordingly, dim W° = n — r = 

dim V — dim W as required. 

(ii) Suppose dimV^w and dim W = r. Then dim V* = n and, by (i), dimTF° = w-r. Thus 
by (i), dim W 00 = «-(«- r) = r; therefore dim W - dim W 00 . By the preceding problem, 
W C W 00 . Accordingly, W = W 00 . 
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11.13. Let U and W be subspaces of V. Prove: (U + W) a = U° n W°. 

Let 0G (U+W) 0 . Then 0 annihilates U+W and so, in particular, 0 annihilates U and V. 
That is, 0 G L/o and 0 G W°\ hence 0 G [7« n W™. Thus (E7 + IF) 0 C C/° n W°. 

On the other hand, suppose a G t/° n Then a annihilates 17 and also W. If t£ U+W, 
then v = m + w where w G U and w G W. Hence cr(v) = o(u) + <r(w) = 0 + 0 = 0. Thus a 
annihilates U+W, i.e. a G (17 + PP)°. Accordingly, U° + JF» c (U + W)°. 

Both inclusion relations give us the desired equality. 

Remark: Observe that no dimension argument is employed in the proof; hence the result holds 
for spaces of finite or infinite dimension. 



TRANSPOSE OF A LINEAR MAPPING 

11.14. Let <j> be the linear functional on R 2 defined by ^(a;, y) = x - 2y. For each of the 

following linear operators T on R 2 , find (T*(t))(x, y): (i) T(x, y) = (x, 0); (ii) T(x,y) = 

(V, x + y); (iii) T(x, y) = (2x - Sy, 5x + 2y). 

By definition of the transpose mapping, T ( (0) = ^3 1 , i.e. (Tt($))(v) = <f>(T(v)) for every 
vector v. Hence 

(i) (T^))(x, y) = <fi(T(x,y)) = ^(x,0) = x 

(ii) (TK<f)){x,y) = <p(T(x,y)) = ^(y,x + y) = y - 2(x + y) = -2x - y 

(iii) (Tt(<f,))(x, y) = <f,(T(x,y)) = <t>(2x - 3y, 5x + 2y) = (2x - 3y) - 2(5x + 2y) = ~ix-ly. 



11.15. Let T : V ■* U be linear and let T* : U* -> V* be its transpose. Show that the kernel 
of T< is the annihilator of the image of T, i.e. Ker T = (Im T)°. 

Suppose 0 G Ker T«; that is, T*(<t>) - <j> o T = 0. If u G Im T, then u = T(v) for some 
v € V; hence 

0(u) = <fi(T(v)) = (4>°T)(v) = 0(v) = 0 

We have that <p(u) = 0 for every u G Im T; hence 0 G (Im T)°. Thus Ker T' C (Im T)°. 

On the other hand, suppose a G (Im T)°; that is, a(Im T) = {0}. Then, for every v 67, 

(T*(<,))(v) = ( a oT)(v) = a(T(v)) = 0 = 0(v) 

We have that (r*(<r))(v) = 0(v) for every n€V; hence Z^a) = 0. Therefore <r G Ker T* and so 
(Im T)<> c Ker T'. 

Both inclusion relations give us the required equality. 



11.16. Suppose V and U have finite dimension and suppose T : V -> U is linear. Prove: 
rank(r) = rank (7/'). 

Suppose dim V = n and dim U = m. Also suppose rank (T) = r. Then, by Theorem 11.5, 
dim ((Im T)°) = dim U - dim (Im T) = m - rank (r) = m - r 

By the preceding problem, Ker T* = (Im T)°. Hence nullity (7 1 ') = m — r. It then follows that, as 
claimed, 

rank(T«) = dim U* - nullity (7"') = m-(m-r-) = r = rank (T) 



11.17. Prove Theorem 11.7: Let T:V-*U be linear and let A be the matrix representation 
of T relative to bases {vi, . . ., v m } of V and {ui, . ..,Un} of U. Then the transpose 
matrix A* is the matrix representation of T' : Z7* -» V* relative to the bases dual to 

{tii} and {>j}. 

Suppose T(Vj) = a n Mi + a 12 u 2 + • • • + a ln u n 

T(v 2 ) = a 21 u x + a^u^ + •■■ + a 2n w„ 



T(v m ) = a OTlMl + a m2 tt 2 + • • • + a m „tt„ 
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We want to prove that _ . , , . 

rHo-i) = + a 21*2 + - • - + a-ml^m 

TK^il — «1201 + a 2202 + ' • • + «m20m 



(2) 



TK"n) — a ln4>l + <Hn<p2 + " * " + Mm 

where {itJ and {<f>j} are the bases dual to {itj and {vj} respectively. 

Let v £ V and suppose v = fc^ + k 2 v 2 + ■■■ + k m v m . Then, by (1), 
T(v) = fej T( Vl ) + k 2 T(v 2 ) + ■■■ + k m T(v m ) 

= fc 1 (a 11 M 1 + • • • + a ln U n ) + fc 2 (o 2 l M l + * • • + a 2 n M n) + " " " + fc m( a ml M l + ' - * + <W») 

= (fcittn + fc 2 a 21 + • • • + fe m a ml )M! + • • - + (k^n + k 2 a 2n + • • • + k m a mn )u n 

n 

= 2 + fc2 ffl 2i + • • • + fc m a mj )Mi 

Hence for j = 1, . . . , «, 

(r'(ffj)(v)) = <rj(2») = _2 (Mii + fc 2 a 2i + t-fc m a m i)«i^ 

= fctatj + k 2 a 2j + ••• + fc TO a m j ( 3 ) 

On the other hand, for j = 1, . . . , n, 

(a u ^ + a 2j 4> 2 + • • • + a mj 4, m )(v) = (aj^! + a 2j 0 2 + • • • + a mj 0 m )(Vi + k 2 v z +•■•+ K v m) 

= k^y + fe 2 a 2j + • • • + k m a mj (&) 
Since v&V was arbitrary, (3) and (b) imply that 

r*(<rj) = a^i + a 2j * 2 + ' • • + a mj <P m , i = 1. • • •> n 
which is (2). Thus the theorem is proved. 

11.18. Let A be an arbitrary m x n matrix over a field Prove that the row rank and the 
column rank of A are equal. 

Let T :K«->K™ be the linear map defined by T(v) = Av, where the elements of K" and K m 
are written as column vectors. Then A is the matrix representation of T relative to the usual bases 
of K n and K m , and the image of T is the column space of A. Hence 

rank (T) = column rank of A 
By Theorem 11.7, A* is the matrix representation of T* relative to the dual bases. Hence 

rank (T ( ) = column rank of A* - row rank of A 
But by Problem 11.16, rank (T) = rank (r<); hence the row rank and the column rank of A are 
equal. (This result was stated earlier as Theorem 5.9, page 90, and was proved in a direct way 
in Problem 5.21.) 



Supplementary Problems 

DUAL SPACES AND DUAL BASES 

11.19. Let 0 : R 3 -» R and a : R 3 -» R be the linear functionals defined by <j>(x, y,z) = 2x-Sy + z and 
o(x, y, z) = 4x - 2y + 3z. Find (i) <f> + °, (ii) 30, (iii) 2<p - 5<r. 

11.20. Let 0 be the linear functional on R* defined by 0(2,1) = 15 and 0(1,-2) = -10. Find <f>(x,y) and, 
in particular, find <f>(— 2, 7). 

11.21. Find the dual basis of each of the following bases of R 3 : 

(i) {(1,0,0), (0,1,0), (0,0,1)}, (ii) {(1,-2,3), (1,-1,1), (2,-4,7)}. 
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11.22. Let V be the vector space of polynomials over B of degree - 2. Let <p lt <f> 2 and 0 3 be the linear 
functional on V defined by 



Here f(t) = a+bt + c&eV and f'(t) denotes the derivative of f(t). Find the basis {f^t), / 2 (t), f 3 (t)} 
of V which is dual to {<f> lt <f> 2 , <Psi- 

11.23. Suppose u,vGV and that <j>{u) = 0 implies <f>(v) = 0 for all <j> £ V*. Show that v = ku for 
some scalar k. 

11.24. Suppose <4,<r£V* and that ${v) = 0 implies a(v) = 0 for all v £ V. Show that <j = fc<6 for 
some scalar fc. 

11.25. Let V be the vector space of polynomials over K. For a G K, define (4 a : V -» K by <&„(/( *)) = /(a). 
Show that: (i) 0 a is linear; (ii) if a ¥= b, then <j> a <t> b . 

11.26. Let V be the vector space of polynomials of degree — 2. Let a, b, c £ K be distinct scalars. Let 
<t> a , 0„ and <t> c be the linear functionals defined by 4> a (f(t)) = f(a), ^ b (/(*)) = /(&), <t> c (f{t)) = f(c). Show 
that {<j> a , 4> h , 4> c ) is linearly independent, and find the basis {/i(t)> / 2 (*), fs(t)} of V which is its dual. 

11.27. Let V be the vector space of square matrices of order n. Let T : V -> K be the trace mapping: 
T(A) = a n + a 22 + • • ■ + a nn , where A = (a i3 ). Show that T is linear. 

11.28. Let W be a subspace of F. For any linear functional <j> on If 7 , show that there is a linear functional 
<r on V such that a(w) = <p(w) for any w £ W, i.e. ^ is the restriction of a to W. 

11.29. Let {e x , . . .,e n } be the usual basis of K n . Show that the dual basis is {ir 1( . . .,ir„} where 7r { is the 
ith projection mapping: !((«„ . . .,a„) = a ( . 

11.30. Let V be a vector space over R. Let <j> v <j> 2 € V* and suppose a : V -» R defined by or(v) = $y(v) fai?) 
also belongs to V*. Show that either #j = 0 or <t> 2 = 0. 

ANNIHILATORS 

11.31. Let W be the subspace of R 4 spanned by (1,2,-3,4), (1,3,-2,6) and (1,4,-1,8). Find a basis of 
the annihilator of W. 

11.32. Let W be the subspace of R 3 spanned by (1, 1,0) and (0, 1, 1). Find a basis of the annihilator of W. 

11.33. Show that, for any subset S of V, L(S) = S 00 where L(S) is the linear span of S. 

11.34. Let U and W be subspaces of a vector space V of finite dimension. Prove: (U n W)° = U° + W°. 

11.35. Suppose V = U ® W. Prove that V* = U° © W°. 

TRANSPOSE OF A LINEAR MAPPING 

11.36. Let <j> be the linear functional on R 2 defined by <p(x, y) = 3x — 2y. For each linear mapping 
T find (T*(4))(x,y,z): 

(i) T(x,y, z) = (x + y.y + z); (ii) T(x, y, z) = (x + y + z, 2x - y). 

11.37. Suppose S.U-+V and T-.V^W are linear. Prove that (T ° S)* = S* <> TK 

11.38. Suppose T : V -» [7 is linear and V has finite dimension. Prove that Im T f = (Ker T) 0 . 

11.39. Suppose T : V -» U is linear and u&U. Prove that uelmf or there exists <j> £ V* such that 
rt(0) = 0 and <f,(u) = 1. 

11.40. Let V be of finite dimension. Show that the mapping T I-+ is an isomorphism from Horn (l 7 , V) 
onto Horn (V*, V*). (Here T is any linear operator on V.) 




o 
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MISCELLANEOUS PROBLEMS 

11.41. Let V be a vector space over R. The line segment uv joining points u, v £ V is denned by 
uv — {tu + (1 — t)v: 0 — t — 1}. A subset S of V is termed convex if u,v e S implies uvcS. 
Let 0£V* and let 

W + = {v e V : <p(v) > 0}, W = {v G V : <f,(v) = 0}, W~ = («eV: 0(f) < 0} 

Prove that W + , W and W~ are convex. 

11.42. Let V be a vector space of finite dimension. A hyperplane H of V is denned to be the kernel of a 
nonzero linear functional <t> on V. Show that every subspace of V is the intersection of a finite 
number of hyperplanes. 



Answers to Supplementary Problems 

11.19. (i) 6x-5y + 4z, (ii) 6* - 9y + 3«, (iii) -16* + 4y - 13z 

11.20. <p(x, y) = 4x + ly, <p(-2, 7) = 41 

11.21. (i) {<t>i(x,y,z) - x, <p 2 (x,y,z) - y, <t> z (x,y,z) = z} 

(ii) 3/. 2) = -3* - 52/ - 2z, 0 2 (*> > *) = 2 * + V> <fis(x, V, ») = * + 2jy + z) 

11.25. (ii) Let /(t) - t. Then 0 a (/(t)) = a ¥> b - 4> b (fW), and therefore <p a # 0 D . 

r _ t 2 - (b + c)t + be _ t*-(a+c)t + ac f { ) = t* - {a + b)t + ab\ 

11.26. |A(t) - (a _ 6)(o _ c) , /»(«)- (6 _ o)(6 ^ c) > >sW (c _ a)(e _ j) J 

11.31. {0i(a;, j/, «, t) = B» — j/ + z, <t> 2 (x, y, z, t) = 2y - t} 

11.32. {<fi(x, y,z) = x-y + z} 

11.36. (i) (T*(<t>)){x, y,z)=3x + y- 2z, (ii) (T^))(x, y, z) = -x + by + 3z. 
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BILINEAR FORMS 

Let V be a vector space of finite dimension over a field K. A bilinear form on V is a 
mapping / : V x V -* K which satisfies 

(i) f(aui + bu 2 , v) = af(ui, v) + bf(u 2 , v) 

(ii) f(u, avi + bv 2 ) - af(u, Vi) + bf{u, v 2 ) 

for all a, b e K and all ih, Vi G V. We express condition (i) by saying / is linear in the 
first variable, and condition (ii) by saying / is linear in the second variable. 

Example 12.1 : Let <j> and <r be arbitrary linear functional on V. Let f:VxV->K be denned by 
f(u, v) = <t>(u) <r(v). Then / is bilinear because 4> and a are each linear. (Such a 
bilinear form / turns out to be the "tensor product" of <t> and a and so is sometimes 
written / = 4> <g> <r.) 



Example 12.2: 



Let / be the dot product on R B ; that is, 

f(u, v) = wv = + a 2 b 2 + 



Example 12.3: 



where u = (a { ) and v = (6$). Then / is a bilinear form on E". 

Let A = (o y ) be any n X n matrix over K. Then A may be viewed as a bilinear 
form / on K n by defining 

Ca n "lz • • • a in \ Ivi 
a2 .\. az2 ..:::.. a2n . \ 
a nl a n2 ... a nn J \ y n j 

n 

= 2 0,^, = a^Xjyy + a^x^ + • • • + a nn x n y n 
U = l 

The above formal expression in variables x it is termed the bilinear polynomial 
corresponding: to the matrix A. Formula (J) below shows that, in a certain sense, 
every bilinear form is of this type. 

We will let B(V) denote the set of bilinear forms on V. A vector space structure is 
placed on B(V) by defining f + g and kf by: 

(f + 9)(u,v) = f(u,v) + g{u,v) 
(kf){u,v) = kf{u,v) 
for any /, g G B(V) and any k G K. In fact, 

Theorem 12.1: Let V be a vector space of dimension n over K. Let {<£ x , ...,</>„} be a 
basis of the dual space V*. Then {/« : i,j = 1, . . .,n} is a basis of B(V) 
where /« is defined by fn(u,v) - £.(%) ^(v). Thus, in particular, 
dimB(V) = n 2 . 
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BILINEAR FORMS AND MATRICES 

Let / be a bilinear form on V, and let {e u . . .,e n ) be a basis of V. Suppose u,v G V 
and suppose 

u = aiei + • • ■ + (Ufin, v = bid + • • • + b n e n 

Then 

f(u, v) — /(aiei + • • • + a„e„, b x e\ + ■ ■ • + b n e n ) 

n 

= aibi/(ei,ei) + aib 2 /(ei,e 2 ) + ••• + a n b n f{en, e n ) = 2 aibjf{ei,e } ) 

i,i = 1 

Thus / is completely determined by the n 2 values f(ei, e 3 ). 

The matrix A = (an) where a y = f(e it e } ) is called the matrix representation of f rel- 
ative to the basis {ei} or, simply, the matrix of f in {e t }. It "represents" / in the sense that 

f(u, v) = 2 aibjf(ei, e t ) = (a lr ...,a n )A 2 = M*A[v] e (1) 

w 

for all u, v G V. (As usual, [u] e denotes the coordinate (column) vector of u G V in the 
basis {ei}.) 

We next ask, how does a matrix representing a bilinear form transform when a new 
basis is selected ? The answer is given in the following theorem. (Recall Theorem 7.4 that 
the transition matrix P from one basis {ei) to another {e'i} has the property that [u] e = P[u] e ' 
for every u G V.) 

Theorem 12.2: Let P be the transition matrix from one basis to another. If A is the 
matrix of / in the original basis, then 

B = P'AP 

is the matrix of / in the new basis. 

The above theorem motivates the following definition. 

Definition: A matrix B is said to be congruent to a matrix A if there exists an invertible 
(or: nonsingular) matrix P such that B = P* A P. 

Thus by the above theorem matrices representing the same bilinear form are congruent. 
We remark that congruent matrices have the same rank because P and P f are nonsingular; 
hence the following definition is well defined. 

Definition: The rank of a bilinear form / on V, written rank(/), is defined to be the rank 
of any matrix representation. We say that / is degenerate or nondegenerate 
according as to whether rank (/) < dim V or rank (/) = dim V. 



ALTERNATING BILINEAR FORMS 

A bilinear form / on V is said to be alternating if 

(i) f(v,v) = 0 

for every v G V. If / is alternating, then 

0 = f(u + V, u + v) = f(u, u) + f(u, v) + f(v, u) + f{v, v) 

and so (ii) f{u, v ) = -f(v,u) 
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for every u,v GV. A bilinear form which satisfies condition (ii) is said to be skew sym- 
metric (or: anti-symmetric). If 1 + 1^0 in K, then condition (ii) implies f(v, v) = — f(v, v) 
which implies condition (i). In other words, alternating and skew symmetric are equivalent 
when 1 + 1^0. 

The main structure theorem of alternating bilinear forms follows. 

Theorem 12.3: Let / be an alternating bilinear form on V. Then there exists a basis of 
V in which / is represented by a matrix of the form 



0 1 I 



I Oil 
1 -1 0 1 



\ 



o 1 



|-i 0 i 
1 



Moreover, the number of 
it is equal to £ rank (/)). 



0 

-1 



is uniquely determined by / (because 



In particular, the above theorem shows that an alternating bilinear form must have 
even rank. 



SYMMETRIC BILINEAR FORMS, QUADRATIC FORMS 

A bilinear form / on V is said to be symmetric if 

f{u,v) = f(v,u) 

for every u, v G V. If A is a matrix representation of /, we can write 

f(X, Y) = X'AY = (X*AYy = Y'A'X 

(We use the fact that X*AY is a scalar and therefore equals its transpose.) Thus if / is 
symmetric, 

Y'A'Z = f{X,Y) = f{Y,X) = Y f AX 

and since this is true for all vectors X, Y it follows that A = A* or A is symmetric. Con- 
versely if A is symmetric, then / is symmetric. 

The main result for symmetric bilinear forms is given in 

Theorem 12.4: Let / be a symmetric bilinear form on V over K (in which 1 + 1^0). 

Then V has a basis {vi, . . .,v„} in which / is represented by a diagonal 
matrix, i.e. f(vi, v,) — 0 for i j. 

Alternate Form of Theorem 12.4: Let A be a symmetric matrix over K (in which 1 + 1^0). 

Then there exists an invertible (or: nonsingular) matrix P such that P'AP 
is diagonal. That is, A is congruent to a diagonal matrix. 
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Since an invertible matrix P is a product of elementary matrices (Problem 3.36), one way 
of obtaining the diagonal form P*AP is by a sequence of elementary row operations and 
the same sequence of elementary column operations. These same elementary row opera- 
tions on I will yield P f . This method is illustrated in the next example. 



Example 12.4: 




1 2-3 

2 5 — 4 j, a symmetric matrix. It is convenient to form the block 
-3-4 8, 





1 1 


2 


-3 


1 


0 




(A, I) = 


2 


5 


-4 


0 


1 


i) 




\-3 


-4 


8 


0 


0 





We apply the operations R 2 -» —2R 1 + R 2 and i? 3 3.^ + -R 3 to (A, I), and then 
the corresponding operations C 2 -* — 2C t + C 2 and C 3 -» 3C t + C 3 to A to obtain 




1 

-2 
3 





/I 


0 


0 


1 


0 




and then 


0 


1 


2 I 


-2 


1 


3 




\o 


2 


-1 


3 


0 





We next apply the operation R 3 -* —2R 2 + R3 and then the corresponding operation 
C 3 -* — 2C 2 + C 3 to obtain 





and then 



0 [ -2 



0 —5 I 7-2 



Now A has been diagonalized. We set 



P = 




and then P'AP = 




Definition: A mapping q:V-*K is called a quadratic form if g(v) = /(v,v) for some 
symmetric bilinear form / on V. 

We call q the quadratic form associated with the symmetric bilinear form /. If 
1 + 1^0 in if, then / is obtainable from q according to the identity 

f(u,v) = U<l(u + v)-q{u)-q{v)) 
The above formula is called the polar form of /. 

Now if / is represented by a symmetric matrix A = (a„), then q is represented in the 
form 



q(X) = f(X,X) = X'AX = (x u ...,».) 



an ai2 




ai„\ 


/Xi 


Cl21 CI22 




a2n \ 


t 


Clnl Ctn2 




Ctnn / 








a^jXitCj 





u 

The above formal expression in variables x t is termed the quadratic polynomial correspond- 
ing to the symmetric matrix A. Observe that if the matrix A is diagonal, then q has the 
diagonal representation 

q(X) = X*AX = aux\ + a 22 x\ + • • • + a nn x 2 n 
that is, the quadratic polynomial representing q will contain no "cross product" terms. By 
Theorem 12.4, every quadratic form has such a representation (when 1 + 1^0). 
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Example 12.5: Consider the following quadratic form on R 2 : 

q(x, y) = 2x 2 - 12xy + 5j/ 2 

One way of diagonalizing q is by the method known as "completing the square" 
which is fully described in Problem 12.35. In this case, we make the substitution 
x = s + St, y = t to obtain the diagonal form 

q(x,y) - 2(s + 3t) 2 - 12(s + 3t)t + 5t 2 = 2s 2 - 13t 2 



REAL SYMMETRIC BILINEAR FORMS. LAW OF INERTIA 

In this section we treat symmetric bilinear forms and quadratic forms on vector 
spaces over the real field R. These forms appear in many branches of mathematics and 
physics. The special nature of R permits an independent theory. The main result follows. 



Theorem 12.5: Let / be a symmetric bilinear form on V over R. Then there is a basis of 
V in which / is represented by a diagonal matrix; every other diagonal 
representation has the same number P of positive entries and the same 
number N of negative entries. The difference S = P — N is called the 

signature of /. 

A real symmetric bilinear form / is said to be nonnegative semidefinite if 

q(v) --- f(v,v) ^ 0 
for every vector v; and is said to be positive definite if 

q(v) = f(v,v) > 0 

for every vector v ¥= 0. By the above theorem, 

(i) / is nonnegative semidefinite if and only if S — rank (/) 

(ii) / is positive definite if and only if S = dim V 

where S is the signature of /. 



Example 12.6: Let / be the dot product on R"; that is, 

f(u,v) = wv = a 1 b 1 + a z b 2 + • • 1 + a n b n 
where u = (a f ) and v — (6 f ). Note that / is symmetric since 

f(u, v) = wv = vu = f(v, u) 
Furthermore, / is positive definite because 

f(u,u) = a\+al + ■■■ +a% > 0 

when u # 0. 

In the next chapter we will see how a real quadratic form q transforms when the transi- 
tion matrix P is "orthogonal". If no condition is placed on P, then q can be represented 
in diagonal form with only l's and — l's as nonzero coefficients. Specifically, 

Corollary 12.6: Any real quadratic form q has a unique representation in the form 

q(xi, . . .,x n ) = »?+•••+*!- x 2 s+1 - •• • - x\ 

The above result for real quadratic forms is sometimes referred to as the Law of Inertia 
or Sylvester's Theorem. 
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HERMITIAN FORMS 

Let V be a vector space of finite dimension over the complex field C. Let f : V x V -» C 
be such that 

(i) f(aui + bu 2 , v) = a f(ui, v) + b f(u 2 , v) 



(ii) f(u,v) = /(*;,«) 

where a, b G C and !)£ V, Then / is called a Hermitian form on 7. (As usual, fc 
denotes the complex conjugate of k G C.) By (i) and (ii), 



f(u, avi + bv 2 ) = f(avi + bv 2 , u) — a f(vi, u) + b f(v 2 , u) 

= df(v u u) + bf(v 2 ,u) = af(u,vi) + bf(u,v 2 ) 

That is, (iii) f(u,avi + bv 2 ) = a f(u, Vi) + 5 /(«, v 2 ) 

As before, we express condition (i) by saying / is linear in the first variable. On the other 
hand, we express cond ition ( iii) by saying / is conjugate linear in the second variable. Note 
that, by (ii), f(v, v) = f(v, v) and so f(v, v) is real for every v G V. 

Example 12.7: Let A = (a i3 ) be an n X n matrix over C. We write A for the matrix obtained by 
taking the complex conjugate of every entry of A, that is, A = (a^). We also write 
A* for A« = AJ. The matrix A is said to be Hermitian if A* = A, i.e. if a y = af x . 
If A is Hermitian, then f{X, Y) =X'AY defines a Hermitian form on C" (Prob- 
lem 12.16). 

The mapping q : V -* R defined by q(v) = f(v, v) is called the Hermitian quadratic form 
or complex quadratic form associated with the Hermitian form /. We can obtain / from 
q according to the following identity called the polar form of /: 

f(u, v) = i(q(u + v)- q(u - v)) + j(q(u + iv) - q(u - iv)) 

Now suppose {ei, . . . , e„} is a basis of V. The matrix H = {h i} ) wher e h i} - f{e it e,) is 
called the matrix representation of / in the basis {e ( }. By (ii), /(e,,e 3 ) = f{e )r e^; hence H 
is Hermitian and, in particular, the diagonal entries of H are real. Thus any diagonal rep- 
resentation of / contains only real entries. The next theorem is the complex analog of 
Theorem 12.5 on real symmetric bilinear forms. 

Theorem 12.7: Let / be a Hermitian form on V. Then there exists a basis {e u ...,e n } of 
V in which / is represented by a diagonal matrix, i.e. /(e { , ei) = 0 for 
i ¥= j. Moreover, every diagonal representation of / has the same number 
P of positive entries, and the same number N of negative entries. The 
difference S = P-N is called the signature of /. 

Analogously, a Hermitian form / is said to be nonnegative semidefinite if 

q(v) = f(v,v) ^ 0 
for every v &V, and is said to be positive definite if 

q(v) - f(v,v) > 0 

for every v ¥= 0. 

Example 12.8: Let / be the dot product on C"; that is, 

f(u, v) — u • V — Z t Wi + z 2 w 2 + • • • + z n w n 
where w = (z 4 ) and v - (wj. Then / is a Hermitian form on C". Moreover, / is 
positive definite since, for any v # 0, 

f(u, U) = + Z 2 * 2 + * * • + *A = l*l! 2 + l*2| 2 + • • • + l*n| 2 > 0 
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Solved Problems 

BILINEAR FORMS 

12.1. Let u = (xi, x 2 , x 3 ) and v = (y u y 2 , y a ), and let 

f(u, v) = 3xiyi - 2xiy z + 5xnyi + lx 2 y 2 - 8x 2 y a + 4x 3 y 2 - x s ya 
Express / in matrix notation. 

Let A be the 3X3 matrix whose y-entry is the coefficient of x^y,. Then 





!z 


-2 






f(u, v) = X*AY = (»!, x 2 , x 3 ) 




7 




0 




\o 


4 







12.2. Let A be an n x n matrix over K. Show that the following mapping / is a bilinear 
form on K» : f(X, Y) = X* A Y . 

For any a, b G K and any X it Y { e K", 

f(aX 1 + bX 2 , Y) = (aX^bX^AY = (aX[ + bX* 2 ) AY 

= aX\AY + bX* 2 AY = af{X u Y) + b f(X 2 , Y) 

Hence / is linear in the first variable. Also, 

f(X, aY x + bY 2 ) = XtA(aY 1 + bY 2 ) = aX*AY l + bX*AY 2 = a f(X, YJ + b f(X, Y 2 ) 

Hence / is linear in the second variable, and so / is a bilinear form on K". 



12.3. Let / be the bilinear form on R 2 denned by 

f((xi, x 2 ), (2/1, 2/2)) = 2x x yi - Sxiy 2 + x 2 y 2 

(i) Find the matrix A of / in the basis {ui = (1,0), Ui = (1, 1)}. 

(ii) Find the matrix B of / in the basis {vi - (2, 1), v 2 = (1, -1)}. 

(iii) Find the transition matrix P from the basis {im} to the basis {Vi), and verify 
that B = P*AP. 

(i) Set A = (o y ) where a {j = /(w*, Uj): 

on = /(%,%) = /((1,0), (1,0)) = 2-0 + 0= 2 

o 12 = f(u v u 2 ) = /((1,0),(1,1)) = 2-3 + 0 = -1 

^ = f(u 2 ,u x ) = (1,0)) = 2-0 + 0= 2 

= f(u 2 ,u 2 ) = /((l, 1), (1, 1)) = 2-3 + 1 = 0 

ft —IN 

Thus A = I g 0 / S the matrix of f in tne basis { u i> u 2i- 

(ii) Set B = (b i} ) where b tj = f(v„ v } ): 

hi = = /((2,1),(2,1)) = 8-6 + 1 = 3 

612 = f(vvv 2 ) = /((2,1), (1,-1)) = 4 + 6-1 = 9 

621 = - /((1,-D,(2,1)) = 4-3-1 = 0 

622 = Kv 2 ,v 2 ) = /((1,-1),(1,-1)) = 2 + 3 + 1 = 6 

/3 9\ 

Thus B = I j is the matrix of / in the basis {v lt v 2 }. 

(iii) We must write v 1 and v 2 in terms of the m s : 

v, = (2,1) = (1,0) + (1,1) = u t + u 2 
v 2 = (1,-1) = 2(1,0) -(1,1) = 2 Mi -m 2 
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Then P = ( J and so P* = Q _J j . Thus 

p<Ap = c ;)c c n = IT) = * 



(2 -i)( 2 o)(i -1 



0 6 



12.4. Prove Theorem 12.1: Let V be a vector space of dimension n over K. Let . . . , <£ n } 
be a basis of the dual space V*. Then {/«: i,/ = 1,. . .,n} is a basis of B(V) where 
/« is defined by f. } {u, v) = ^(v). Thus, in particular, dim B(V) = n 2 . 

Let {e v . . ., e n } be the basis of V dual to We first show that {f tj } spans B{V). Let / G B(V) 

and suppose /(e i( e 3 ) = o y . We claim that / = 2 <*«/«. It suffices to show that /(e s ,e t ) = 
(2 aij/ij)(e s , c t ) for s,t = 1,. . ,,n. We have 

(2 «ij/ij)(e s , e t ) = 2aij/«(e s ,e t ) = 2 fc(« s ) ^(e t ) 

= 2aij«i S S jt = a sl = /(e s ,e ( ) 

as required. Hence spans B(V). 

It remains to show that is linearly independent. Suppose 2ay/ij = 0. Then for 

s,t = 1,. . ,,n, 

0 = 0(e s ,e ( ) = (2ay/ij)(e s ,e t ) = a rs 
The last step follows as above. Thus {/ y } is independent and hence is a basis of B(V). 



12.5. Let [/] denote the matrix representation of a bilinear form f on V relative to a basis 
{ei, . . . , e„} of V. Show that the mapping / h> [/] is an isomorphism of B(V) onto 
the vector space of «-square matrices. 

Since / is completely determined by the scalars f{e it e ; ), the mapping /!-»[/] is one-to-one and 
onto. It suffices to show that the mapping / h» [/] is a homomorphism; that is, that 

[af+bg] = a[f] + b[g] (*) 

However, for i,j = 1,. . .,n, 

(af + bg)(e it e } ) = af(e u + bg(e it e } ) 
which is a restatement of (*). Thus the result is proved. 



12.6. Prove Theorem 12.2: Let P be the transition matrix from one basis {ei} to another 
basis {el}. If A is the matrix of / in the original basis {ei}, then B — P t AP is the 
matrix of / in the new basis {e'J. 

Let m, v G V. Since P is the transition matrix from {ej to {e,'}, we have P[u] e . — [u] e and 
p We' = We> hence [u^ = [u]l, PK Thus 

f(u,v) = [m]« A [v] e = [u]l.P*AP[v] e . 

Since u and v are arbitrary elements of V, P t A P is the matrix of / in the basis {e[}. 



SYMMETRIC BILINEAR FORMS. QUADRATIC FORMS 

12.7. Find the symmetric matrix which corresponds to each of the following quadratic 
polynomials: 

(i) q(x,y) - 4x 2 -6xy-7y 2 (iii) q{x,y,z) = 3x 2 + 4xy - y 2 + 8xz - 6yz + z 2 

(ii) q(x,y) = xy + y 2 (iv) q(x,y,z) = x 2 -2yz + xz 

The symmetric matrix A = (a y ) representing q(x lt . . . , x n ) has the diagonal entry a u equal to 
the coefficient of xf and has the entries a y and each equal to half the coefficient of x t Xj. Thus 
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(4 :?) C t) 



(i) 



(ii) 





12.8. For each of the following real symmetric matrices A, find a nonsingular matrix P 
such that P* A P is diagonal and also find its signature: 




(ii) A 






(i) 



(i) First form the block matrix (A, I): 

(A, I) = 



Apply the row operations R 2 ^ SR l + R 2 and R 3 -» — 2.R, + R 3 to (A,/) and then the corre- 
sponding column operations C 2 -» 3C t + C 2 and C 3 -» — 2C\ + C 3 to A to obtain 



and then 



Next apply the row operation R 3 -* R 2 + 2R 3 and then the corresponding column operation 
C 3 -* C 2 + 2C 3 to obtain 






0 


0 


1 


0 


0 


2 


1 


3 


1 


0 


0 


9 


-1 


1 


2 



and then 




Now A has been diagonalized. Set P = 
The signature S of A is S = 2 - 1 




then P*AP = 




1. 



(ii) First form the block matrix (A, /): 

(A,/) = 



In order to bring the nonzero diagonal entry —1 into the first diagonal position, apply the row 
operation R x <-> R 3 and then the corresponding column operation C t «-» C 3 to obtain 



1 


1 


1 


0 


0 


-2 


2 


0 


1 


0 


2 


-1 


0 


0 


1 



2 


-1 


0 


0 


-2 


2 


0 


1 


1 


1 


1 


0 



and then 



2 1 
-2 1 
1 0 




Apply the row operations R 2 -* 2i?j + J? 2 an d JB3 i? t + i? 3 and then the corresponding 
column operations C 2 -^2C 1 + C 2 and C 3 -* C t + C 3 to obtain 



/-l 
0 

\ 0 



2 
2 
3 




and then 
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Apply the row operation R 3 ■ 
C 3 -> -3C 2 + 2C 3 to obtain 



-3JZ 2 + 2i? 3 and then the corresponding column operation 





and then 



Now A has been diagonalized. Set P = 

The signature S of A is the difference S = 1 — 2 — — 1 




-1 


0 


0 


0 


0 


0 


2 


0 


0 


1 


0 


0 


-14 


2 


-3 



then P*AP 



1-1 
0 



0 -14, 



12.9. 1 Suppose 1 + 1*0 in K. Give a formal algorithm to diagonalize (under congruence) 
a symmetric matrix A = (an) over K. 

Case I: a n ¥= 0. Apply the row operations R t -^ —a ll R 1 + a n R it i = 2,...,n, and then the 

corresponding column operations C t -* —a n C l + a n Ci to reduce A to the form ^ a * 1 . 

Case II: a u = 0 but a a ¥* 0, for some i > 1. Apply the row operation R 1 *->R i and then the 
corresponding column operation C 1 <-» C { to bring a u into the first diagonal position. This reduces 
the matrix to Case I. 

Case III: All diagonal entries a u = 0. Choose i, j such that a tj # 0, and apply the row opera- 
tion R t -» Jfy + iZj and the corresponding column operation C i -* Cj + C; to bring 2a i} # 0 into the 
ith diagonal position. This reduces the matrix to Case II. 

/<»ii 0\ 

In each of the cases, we can finally reduce A to the form ( Q B j where £ is a symmetric 
matrix of order less than A. By induction we can finally bring A into diagonal form. 

Remark: The hypothesis that 1 + 1 # 0 in K, is used in Case III where we state that 2a iS # 0. 



12.10. Let q be the quadratic form associated with the symmetric bilinear form /. Verify 
the following polar form of /: f(u,v) = Uq{u + v) - q(u) - q(v)). (Assume that 
1 + 1*0.) 

q(u + v) — q(u) — q(v) = f(u + v, u + v) — f(u, u) — f(v, v) 

= f(u, u) + f(u, v) + f(v, u) + f(v, v) - f{u, u) - f(v, v) 
= 2/(w,«) 

If 1 + 1^0, we can divide by 2 to obtain the required identity. 



12.11. Prove Theorem 12.4: Let / be a symmetric bilinear form on V over K (in which 
1 + 1*0). Then V has a basis {vi, ...,v n ) in which / is represented by a diagonal 
matrix, i.e. f(Vi, v,) = 0 for i j. 

Method 1. 

If / = 0 or if dim V — 1, then the theorem clearly holds. Hence we can suppose / # 0 and 
dim V = n > 1. If q(v) = f(v, v) = 0 for every v G V, then the polar form of / (see Problem 12.10) 
implies that / = 0. Hence we can assume there is a vector DjEV such that f(v u ¥= 0. Let 
U be the subspace spanned by v t and let W consist of those vectors v G V for which f(v lt v) = 0. 
We claim that V = U 0 W. 

(i) Proof that U n W = {0}: Suppose u e U n W. Since u G 17, u~kv l for some scalar k G K. 
Since u G W, 0 = /(it, u) = f(kv u kvj = k 2 f(v u v^. But # 0; hence fc = 0 and there- 

fore it = fcf ! = 0. Thus UnW = {0}. 
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(ii) Proof that V = U + W: Let v S V. Set 

^i) 

Then f(v u w) = - ^^/(« x .« 1 ) = 0 

Thus to S If . By (i), i; is the sum of an element of U and an element of W. Thus V = f/ + W. 
By (i) and (ii), V =U®W. 

Now / restricted to W is a symmetric bilinear form on W. But dim W — n — 1; hence by 
induction there is a basis {i> 2 , ...,n„} of W such that f(v t , v>) = 0 for i ¥° j and 2 — i, ? — n. But 
by the very definition of W, f{v lt v } ) =0 for j = 2, . . . , n. Therefore the basis {v u . . . , v n } of V 
has the required property that f(v it Vj) = 0 for i # j. 

Method 2. 

The algorithm in Problem 12.9 shows that every symmetric matrix over K is congruent to a 
diagonal matrix. This is equivalent to the statement that / has a diagonal matrix representation. 



12.12. Let A = 



lai 



0,2 



, a diagonal matrix over K. Show that: 



(i) for any nonzero scalars ki, . . . , fe„ G K, A is congruent to a diagonal matrix 
with diagonal entries atkf; 

(ii) if K is the complex field C, then A is congruent to a diagonal matrix with only 
l's and 0's as diagonal entries; 

(iii) if K is the real field R, then A is congruent to a diagonal matrix with only 
l's, —l's and 0's as diagonal entries. 



(i) Let P be the diagonal matrix with diagonal entries fc { . Then 



P*AP = 



/ h 



\ 




a 2 



fc 2 



i^i 



a 2 fe| 



. Then PtAP has 



(ii) Let P be the diagonal matrix with diagonal entries b t 
the required form. 

(iii) Let P be the diagonal matrix with diagonal entries b t =•{'[' " " . • Then P*AP has 
the required form. 

Remark. We emphasize that (ii) is no longer true if congruence is replaced by Hermitian con- 
gruence (see Problems 12.40 and 12.41). 



fl/Voi if <*i ¥> 0 

\ 1 if o 4 = 0 

f 1A/RI if «j ^ 0 

~ \ 1 if % = 0 



12.13. Prove Theorem 12.5: Let / be a symmetric bilinear form on V over R. Then there 
is a basis of V in which / is represented by a diagonal matrix, and every other diagonal 
representation of / has the same number of positive entries and the same number of 
negative entries. 

By Theorem 12.4, there is a basis {w,, . . . , i* n } of V in which / is represented by a diagonal 
matrix, say, with P positive and N negative entries. Now suppose {w u . .., w n } is another basis of 
V in which / is represented by a diagonal matrix, say, with P' positive and N' negative entries. We 
can assume without loss in generality that the positive entries in each matrix appear first. Since 
rank (/) = P + N = P' + N', it suffices to prove that P = P'. 
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Let U be the linear span of u u . . .,u P and let W be the linear span of w P , + 1 , ...,w n . Then 
f(v,v) > 0 for every nonzero v G U, and f(v,v) ^ 0 for every nonzero v G W. Hence 
U n W = {0}. Note that dim [7 = P and dim W = n - P'. Thus 

dim(C7+TF) = dim U + dim W - dim (E7nJF) = P + («-P')-0 = P - P' + n 

But dim (E7 + TP) — dim V = n; hence P-P' + n^w or P ^ P'. Similarly, P' ^ P and there- 
fore P = P', as required. 

Remark. The above theorem and proof depend only on the concept of positivity. Thus the 
theorem is true for any subfield K of the real field R. 



12.14. An n x n real symmetric matrix A is said to be positive definite if X*AX > 0 for 
every nonzero (column) vector X £ R", i.e. if A is positive definite viewed as a 
bilinear form. Let B be any real nonsingular matrix. Show that (i) B*B is sym- 
metric and (ii) B*B is positive definite. 

(i) (B*BY = B'B U = P'P; hence B*B is symmetric. 

(ii) Since B is nonsingular, BX ¥> 0 for any nonzero X G R n . Hence the dot product of BX with 
itself, BX-BX = (BX)t(BX), is positive. Thus XHB*B)X = (X*Bt)(BX) = (BX)t(BX) > 0 as 
required. 



HERMITIAN FORMS 

12.15. Determine which of the following matrices are Hermitian: 

2 + Si 
5 

6-2i 
(i) 

A matrix A = (a y ) is Hermitian iff A = A*, i.e. iff a y = ofi. 

(i) The matrix is Hermitian, since it is equal to its conjugate transpose. 

(ii) The matrix is not Hermitian, even though it is symmetric. 

(iii) The matrix is Hermitian. In fact, a real matrix is Hermitian if and only if it is symmetric. 





4 -3 


5 


-3 2 




5 1 


-6 


(iii) 





12.16. Let A be a Hermitian matrix. Show that / is a Hermitian form on C n where / is 
defined by f(X, Y) = X { A Y. 

For all a, b G C and all X„ X 2 , Y G C», 

f(aX 1 + bX 2 , Y) = (aXi + bX^AY = (aX\+bX\)AY 

= aX\AY + bXlAY = af(X u Y) + bf(X 2 ,Y) 

Hence / is linear in the first variable. Also, 

TCXTF) = FIT = (X*AYY = Y*A*X = Y*A*X = Y*AX = f(Y,X) 
Hence / is a Hermitian form on C". (Remark. We use the fact that X* A Y is a scalar and so it is 
equal to its transpose.) 



12.17. Let / be a Hermitian form on V. Let H be the matrix of / in a basis {e h ...,e n ) of 
V. Show that: 

(i) f(u,v) = [u]lH\v\ e for all u,v S V; 

(ii) if P is the transition matrix from {e 4 } to a new basis {e,'} of V, then B = P t HP 
(or: B — Q*HQ where Q = P) is the matrix of / in the new basis {e,'}. 

Note that (ii) is the complex analog of Theorem 12.2. 

(i) Let u,v GV and suppose 

u = + a 2 e 2 + • • ' + «A and v = fc^ + b 2 e 2 + • • • + b n e n 
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Then f(u, v) = /(a^j + • ■ • + a n e n , h x e x + • • ■ + b n e n ) 

= 2 a-fij f(e t , e,) = (a lt ...,a n )H 6 

i, i 

as required. 

(ii) Since P is the transition matrix from {e { } to {e/}, then 



P[u] e . = [«]„ P[v] e . = [v] 



and so [u\l= [u]l,PK [v] e = P [v] 



Thus by (i), f(u,v) = [a]|ff[t] ( = [u]l. P t H P [v] e .. But m and v are arbitrary elements of V; 
hence P t HP is the matrix of / in the basis {e,'}. 



1 +i 2i 

12.18. Let H = | 1 - i 4 2 — Si j , a Hermitian matrix. Find a nonsingular ma- 

-2i 2 + Si 7 
trix P such that P l HP is diagonal. 
First form the block matrix (H, I): 

1 1 + i 2i 
1-i 4 2-3i 
-2i 2 + 3i 7 

Apply the row operations B 2 -* (—1 + + fl 2 and fi 3 -» 2^ + i? 3 to (A, J) and then the 
corresponding "Hermitian column operations" (see Problem 12.42) C 2 -* (—1 — i)C x + C 2 and 
C 3 -» — 2iC! + C 3 to A to obtain 





and then 




0 

-hi 
3 



1 

-1 + 

2,i 



0 0" 

1 0 
0 1, 



Next apply the row operation R 3 -» — 5iR 2 + 2i? 3 and the corresponding Hermitian column opera- 
tion C 3 -» 5iC 2 + 2C 3 to obtain 





and then 




1 

-1 + i 



Now H has been diagonalized. Set 

/ 1 -1 + t 5 + 9i 1 
P = 0 1 -5i 
\0 0 2 



and then P'HP 




5 + 9i -5i 



0 -38/ 



Observe that the signature S of H is 5 = 2 — 1 = 1. 



MISCELLANEOUS PROBLEMS 

12.19. Show that any bilinear form / on V is the sum of a symmetric bilinear form and a 
skew symmetric bilinear form. 

Set g(u, v) = i}[f(u, v) + f(v, it)] and h(u, v) = ^[f(u,v) — f(v, u)]. Then g is symmetric because 
g(u,v) = £[/(m,v) + /(v,m)] = i[f(v,u) + f{u,v)] = g(v,u) 
and h is skew symmetric because 

h(u,v) = i[f(u,v) - f(v,u)] = -%[f(v,u) -f(u,v)] = -h(v,u) 

Furthermore, f — g + h. 
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12.20. Prove Theorem 12.3: Let / be an alternating bilinear form on V. Then there exists 
a basis of V in which / is represented by a matrix of the form 

o i ; 

-1 o i 



T~o~Y! 



o 
-l 



0 I 



! 0 1 
i — i . 



! o 



o 1 

-1 o 



is uniquely determined by / (because it is equal 



Moreover, the number of 
to i[rank (/)]). 

If / = 0, then the theorem is obviously true. Also, if dim V = 1, then f(kiu,, k 2 u) = 
fc 1 fc 2 f(u, u) = 0 and so / = 0. Accordingly we can assume that dim V > 1 and / ¥* 0. 

Since / # 0, there exist (nonzero) u u w 2 G V such that f(u v u 2 ) ¥> 0. In fact, multiplying 
u x by an appropriate factor, we can assume that f(u ly u 2 ) = 1 and so f(u 2 ,Ux) = — 1. Now % and 
u 2 are linearly independent; because if, say, u 2 = ku u then /(%, u 2 ) = f(u lr kuj) = k f(u v u{) = 0. 
Let U be the subspace spanned by u x and u 2 , i.e. U - L(m 1 ,m 2 ). Note: 

(i) the matrix representation of the restriction of / to U in the basis {u v w 2 } is I _^ ^ 1 ; 

(ii) if u£ U, say w = attj + bu 2 , then 

/(w.Wj) = /(awj + bu 2 , Mi) = -6 

f(u,u 2 ) = f(au 1 + bu 2 ,u 2 ) = a. 

Let W consist of those vectors w G V such that f(w, u t ) = 0 and f(w, u 2 ) = 0. Equivalently, 

W — {wGV: f(w, u) — 0 for every it 6 U] 

We claim that V = U ® W. It is clear that 17 n W = {0}, and so it remains to show that 
V = U+W. Let v £ V. Set 

M = /(v, m 2 K - /(«, «i)M2 and w = f-« U) 

Since u is a linear combination of u t and w 2 , w G 17. We show that W eW. By (i) and (ii), 
/(m,«j:) = /(v.Mj); hence 

/(W, = f(v - U, Mj) = f(v, Uj) - f(u, Wj) = 0 

Similarly, f(u, u 2 ) = f(v, u 2 ) and so 

f(w,u 2 ) = f(v-u,u 2 ) = ftv.uj ~ f(u,u 2 ) = 0 

Then w€W and so, by (1), v = w + w where w G [7 and w G W. This shows that V = t/ + W; 
and therefore V = U 0 IF. 

Now the restriction of / to W is an alternating bilinear form on W. By induction, there exists 
a basis u 3 , . . .,u„ of W in which the matrix representing / restricted to W has the desired form. 
Thus u v u 2 ,u 3 ,. . .,«„ is a basis of V in which the matrix representing / has the desired form. 
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Supplementary Problems 

BILINEAR FORMS 

12.21. Let m = (x u x 2 ) and v = (y lt y 2 ). Determine which of the following are bilinear forms on R 2 : 

(i) f(u,v) = 2x 1 y 2 -3x 2 y 1 (iv) f(u,v) = * 1 * 2 + 

(ii) f(u,v) = x t + y 2 (v) f(u,v) = 1 

(iii) f(u,v) = 3x^2 (vi) f(u,v) = 0. 



12.22. Let / be the bilinear form on R 2 defined by 

/((*!, x 2 ), (y u y 2 )) = 3x x y x - 2x x y 2 + ±x 2 y x - x 2 y 2 

(i) Find the matrix A of / in the basis {mj = (1, 1), u 2 = (1, 2)}. 

(ii) Find the matrix B of / in the basis {v l = (1, —1), v 2 - (3, 1)}. 

(iii) Find the transition matrix P from {mJ to {v { } and verify that B -P*AP. 



12.23. Let V be the vector space of 2 X 2 matrices over R. Let M = [\ 2 ), and let f(A,B) = 



.3 5, 

tr(A'MJS) where A,BGV and "tr" denotes trace, (i) Show that / is a bilinear form on V. 



(ii) Find the matrix of / in the basis 



{(; ik in; sms : 



12.24. Let B(V) be the set of bilinear forms on V over K. Prove: 

(i) if f,ge.B(V), then f + g and kf, for k&K, also belong to B(V), and so B(V) is a subspace 
of the vector space of functions from V X V into K; 

(ii) if $ and a are linear f unctionals on V, then f(u, v) = £(«.) a(v) belongs to B(V). 

12.25. Let / be a bilinear form on V. For any subset S of V, we write 

S 1 = {vGV : f(u,v) = 0 for every uGS}, S T = {vGV: f(v,u) = 0 for every tt G S} 

Show that: (i) S 1 and S T are subspaces of V; (ii) S!CS 2 implies S 2 i cS 1 i and C s[ ; 

(iii) {0} 1 ={0} T =y. 

12.26. Prove: If / is a bilinear form on V, then rank (/) = dim V - AimV^ = dim V — dimV T and 
hence dim V 1 - = dim V T . 

12.27. Let / be a bilinear form on V. For each uGV, let u :V -* K and u : V -* K be defined by 

u(x) = f(x, u) and u(x) = f(u, x). Prove: 

(i) u and u are each linear, i.e. u,u G V*; 

(ii) u h+ u and m h» tt are each linear mappings from V into V*; 

(iii) rank (/) = rank (u h» m) = rank (m h» zt). 

12.28. Show that congruence of matrices is an equivalence relation, i.e. (i) A is congruent to A; (ii) if A 
is congruent to B, then B is congruent to A; (iii) if A is congruent to B and f? is congruent to C, 
then A is congruent to C. 



SYMMETRIC BILINEAR FORMS. QUADRATIC FORMS 

12.29. Find the symmetric matrix belonging to each of the following quadratic polynomials: 

(i) V, z) = 2x 2 - 8xy + y 2 - 16xz + Uyz + 5z 2 

(ii) q(x, y, z) = x 2 — xz + y 2 

(iii) q(x, y, z) — xy + y 2 + 4xz + z 2 

(iv) q(x, y, z) = xy + yz. 
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12.30. For each of the following matrices A, find a nonsingular matrix P such that P t AP is diagonal: 



(i) A = 




In each case find the rank and signature. 

12.31. Let q be the quadratic form associated with a symmetric bilinear form /. Verify the following 
alternate polar form of /: f(u,v) = ±[q(u + v) — q(u — v)]. 

12.32. Let S(V) be the set of symmetric bilinear forms on V. Show that: 

(i) S(V) is a subspace of B(V); (ii) if dim V = n, then dimS(F) = An(n + 1). 



12.33. Let / be the symmetric bilinear form associated with the real quadratic form q(x,y) = 
ax 2 + bxy + ey 2 . Show that: 

(i) / is nondegenerate if and only if b 2 — 4ac # 0; 

(ii) / is positive definite if and only if a > 0 and b 2 — 4ac < 0. 

12.34. Suppose A is a real symmetric positive definite matrix. Show that there exists a nonsingular matrix 
P such that A = P*P. 

n 

12.35. Consider a real quadratic polynomial q(x v . . . , x n ) = { 2 ^ «tj*i*j, where a {j = a j{ . 

(i) If a u # 0, show that the substitution 

*1 - J/i — (Oi2J/2+ ••" +«ln2/n). x 2 = Vi, ••■> %n = Vn 

a n 

yields the equation q(x u ...,«„) = a n yl + q'(y 2 , ■■ -,y n ), where q' is also a quadratic 
polynomial. 

(ii) If a n = 0 but, say, a n ¥> 0, show that the substitution 

^1 = 2/1 + V2> %2 = Vl — Vi> x 3 = 3/3. • • • ' x n - Vn 

yields the equation q(x u ...,x n ) = 2 h^i' where 6 " °' i-e. reduces this case to case (i). 
This method of diagonalizing q is known as "completing the square". 

12.36. Use steps of the type in the preceding problem to reduce each quadratic polynomial in Problem 
12.29 to diagonal form. Find the rank and signature in each case. 



HERMITIAN FORMS 

12.37. For any complex matrices A,B and any k G C, show that: 

(i) A+B = A + B, (ii) kA = kA, (iii) AB = A B, (iv) A* = A*. 

12.38. For each of the following Hermitian matrices H, find a nonsingular matrix P such that P* H P is 
diagonal: 

c> »-(.! 0' (i " '■(«-« 2 -')' mB ' 

Find the rank and signature in each case. 

12.39. Let A be any complex nonsingular matrix. Show that H = A* A is Hermitian and positive 
definite. 

12.40. We say that B is Hermitian congruent to A if there exists a nonsingular matrix Q such that 
B = Q* A Q. Show that Hermitian congruence is an equivalence relation. 
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12.41. Prove Theorem 12.7: Let / be a Hermitian form on V. Then there exists a basis {e u ...,e n } of V 
in which / is represented by a diagonal matrix, i.e. f(e it e 3 ) = 0 for i # j. Moreover, every diagonal 
representation of / has the same number P of positive entries and the same number N of negative 
entries. (Note that the second part of the theorem does not hold for complex symmetric bilinear 
forms, as seen by Problem 12.12(h). However, the proof of Theorem 12.5 in Problem 12.13 does 
carry over to the Hermitian case.) 



MISCELLANEOUS PROBLEMS 

12.42. Consider the following elementary row operations: 

R { R jt [a 2 ] ftj -* kR if k # 0, [a 3 ] R f -> kR } + R { 
The corresponding elementary column operations are, respectively, 

[6J Ci^q, [b 2 ] C ( -* kC it k * 0, [b 3 ] C^kCj + Ci 
If K is the complex field C, then the corresponding Hermitian column operations are, respectively, 

[ej Ci-^Ci, [c 2 ] Ci^feCj, k^O, [c 3 ] Ci^kCj + Ci 

(i) Show that the elementary matrix corresponding to [6 J is the transpose of the elementary 
matrix corresponding to [a t ]. 

(ii) Show that the elementary matrix corresponding to [cj is the conjugate transpose of the ele- 
mentary matrix corresponding to [aj. 

12.43. Let V and W be vector spaces over K. A mapping /: VXW-+K is called a bilinear form on V 
and W if: 

(i) f(av t + bv 2 , w) = af(v v w) + bf(v 2 , w) 

(ii) f(v, aw t + bw 2 ) = af(v, w t ) + bf(v, w 2 ) 

for every a, b e K, v { e V, w } e W. Prove the following: 

(i) The set B(V, W) of bilinear forms on V and If is a subspace of the vector space of functions 
from VXW into K. 

(ii) If </> m } is a basis of V* and ,..,<,„} is a basis of W*, then {f fj : i=l,...,m, 
j = l,...,n} is a basis of B(V, W) where / {j is defined by f i} (v, w) - </> t {v) oAw). Thus 
dim B(V, W) = dim V • dim W. 

(Remark. Observe that if V = W, then we obtain the space B(V) investigated in this chapter.) 

m times 
i * \ 

12.44. Let V be a vector space over K. A mapping f: VXVX---xV-*K is called a multilinear (or: 
m-linear) form on V if / is linear in each variable, i.e. for i = 1, . . ,,m, 

/(..., alT+bZ, ...) = a /(..., m, ...) + bf(...,v, ...) 

where denotes the ith component, and other components are held fixed. An «i-linear form / is 
said to be alternating if 

. . . , v m ) = 0 whenever v i = V k , i¥* k 

Prove: 

(i) The set B m (V) of m-linear forms on V is a subspace of the vector space of functions from 
V X V X • • • XV into K. 

(ii) The set A m (V) of alternating m-linear forms on V is a subspace of B m (V). 
Remark 1. If m = 2, then we obtain the space B{V) investigated in this chapter. 

Remark 2. If V = K m , then the determinant function is a particular alternating m-linear form on V. 
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Answers to Supplementary Problems 



12.21. (i) Yes (ii) No (iii) Yes (iv) No (v) No (vi) Yes 



12.22 



• (i) A = ( 4 7 J) (ii) B = ( 2 J 3 ^) (iii) P = ( 



3 5 
-2 -2 



12.23. (ii) 




2 -4 -8\ / 1 0 -i\ /0 A 2\ /0 i 0' 

12.29. (i) ( -4 1 7 (ii) 0 10 (iii) U 1 0 (iv) U 0 £ 
1-8 7 5/ \-i 0 0/ \2 0 1/ \0 i 0 y 



12.30. (i) P = (J - 2 3 ), P'AP = (J _ 2 °), 



S = 0. 



(ii) p = 



(iii) p = 




P»AP = 





0 


0 






0 


2 


0 


)■ 


s = 


0 


0 


-38, 








fl 


0 


0 


°\ 




0 


1 


0 


0 




0 


0 


-7 


0 




\o 


0 


0 


469/ 



S = 2. 



12.38. (i) P = 

(ii) P = 




i 


-8 + i ' 




<1 


0 




1 




, P'HP = 


0 


1 




0 


! i 






0 
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Inner Product Spaces 

INTRODUCTION 

The definition of a vector space V involves an arbitrary field K. In this chapter we 
restrict K to be either the real field R or the complex field C. In the first case we call V a 
real vector space, and in the second case a complex vector space. 

Recall that the concepts of "length" and "orthogonality" did not appear in the investiga- 
tion of arbitrary vector spaces (although they did appear in Chapter 1 on the spaces R" and 
C n ). In this chapter we place an additional structure on a vector space V to obtain an 
inner product space, and in this context these concepts are defined. 

We emphasize that V shall denote a vector space of finite dimension unless otherwise 
stated or implied. In fact, many of the theorems in this chapter are not valid for spaces of 
infinite dimension. This is illustrated by some of the examples and problems. 

INNER PRODUCT SPACES 

We begin with a definition. 

Definition: Let V be a (real or complex) vector space over K. Suppose to each pair of 
vectors u, v G V there is assigned a scalar (u, v) € K. This mapping is called 
an inner product in V if it satisfies the following axioms: 

[h] (aui + bu 2 , v) = a(u u v) + b{u 2 , v) 

[h] (u,v) = (vTu) 

[h] {u, u) ^ 0; and {u, u) - 0 if and only if u = 0. 

The vector space V with an inner product is called an inner product space. 

Observe that (u,u) is always real by [h], and so the inequality relation in [h] makes 
sense. We also use the notation 

||U|| = V<M,M> 

This nonnegative real number \\u\\ is called the norm or length of u. Also, using [h] and 
[7 2 ] we obtain (Problem 13.1) the relation 

(u, avi + bv 2 ) = d{u, Vi) + b(u, v 2 ) 

If the base field K is real, the conjugate signs appearing above and in [h] may be ignored. 

In the language of the preceding chapter, an inner product is a positive definite sym- 
metric bilinear form if the base field is real, and is a positive definite Hermitian form if the 
base field is complex. 

A real inner product space is sometimes called a Euclidean space, and a complex inner 
product space is sometimes called a unitary space. 
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Example 13.1 : Consider the dot product in R": 

u • v = a t b 1 + a 2 b 2 + • • • + a n b n 

where u — (a s ) and v — (6j). This is an inner product on R n , and R n with this 
inner product is usually referred to as Euclidean n-spa-ce. Although there are 
many different ways to define an inner product on R" (see Problem 13.2), we shall 
assume this inner product on R" unless otherwise stated or implied. 

Example 13.2: Consider the dot product on C": 

U • V = ZjWj + Z 2 W 2 + ■ • • + Z n ™n 

where it = (zj) and v = (w t ). As in the real case, this is an inner product on C" 
and we shall assume this inner product on C n unless otherwise stated or implied. 

Example 13.3: Let V denote the vector space ofmXii matrices over R. The following is an inner 
product in V: 

(A,B) = tr(B'A) 
where tr stands for trace, the sum of the diagonal elements. 

Analogously, if U denotes the vector space of mXn matrices over C, then the 
following is an inner product in U: 

(A,B) = tr(B*A) 
As usual, B* denotes the conjugate transpose of the matrix B. 

Example 13.4: Let V be the vector space of real continuous functions on the interval a — t — b. 
Then the following is an inner product on V: 

if. 9) = Cf(t)s(t)dt 

Analogously, if U denotes the vector space of complex continuous functions on the 
(real) interval a ^ t - b, then the following is an inner product on 17: 



if, 9) = j f(t)g(t)dt 



Example 13.5: Let V be the vector space of infinite sequences of real numbers (a v a 2 , . . .) satisfying 

2 o? = a\+ a\+ ■•• < » 
»=i 

i.e. the sum converges. Addition and scalar multiplication are defined component- 
wise: 

{a u a 2 , . . .) + (&i,&2» • • ■) = (a i + b i ,a 2 +b 2 ,...) 

k(a v a 2 , . . . ) = (fca-!, ka 2 , . . . ) 

An inner product is defined in V by 

<(a 1( 02» •••). (h>h. •••)> = «i&i + a 2 b 2 + ■ ■ ■ 

The above sum converges absolutely for any pair of points in V (Problem 13.44); 
hence the inner product is well defined. This inner product space is called J 2 -space 
(or: Hilbert space). 



Remark 1: If \\v\\ = 1, i.e. if (v,v) = 1, then v is called a unit vector or is said to be 
normalized. We note that every nonzero vector u G V can be normalized by 
setting v = u/\\u\\. 

Remark 2: The nonnegative real number d(u,v) = ||» — is called the distance between 
u and v; this function does satisfy the axioms of a metric space (see Problem 
13.51). 
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CAUCHY-SCHWARZ INEQUALITY 

The following formula, called the Cauchy-Schwarz inequality, is used in many branches 
of mathematics. 

Theorem 13.1: (Cauchy-Schwarz): For any vectors u,vGV, 

\(u,v)\ ^ hi hi 

Next we examine this inequality in specific cases. 

Example 13.6: Consider any complex numbers a lt . . .,a n , 6 1; . . ., b n G C. Then by the Cauchy- 
Schwarz inequality, 

(«!&!+••• +a n b n ) 2 ~ <KI 2 + •■• +KI 2 )(|6il 2 + ••• + I&J 2 ) 

that is, ^ ||m|| 2 IMI 2 

where u — (a ( ) and v = (6j). 

Example 13.7: Let / and g be any real continuous functions defined on the unit interval 0 — t — 1. 
Then by the Cauchy-Schwarz inequality, 

«/,ff» 2 = (j^ f(t)g(t) (ft)* * j%Ht)dt j\Ht)dt = ||/|[ 2 ||ff|P 

Here V is the inner product space of Example 13.4. 

ORTHOGONALITY 

Let V be an inner product space. The vectors u, v G V are said to be orthogonal if 
(u>v) = 0. The relation is clearly symmetric; that is, if u is orthogonal to v, then (v, u) = 
{u, v) = 0 = 0 and so v is orthogonal to u. We note that 0 G V is orthogonal to every 

v G V for , 

<0,v> = (0v,v) = 0(v,v) = 0 

Conversely, if u is orthogonal to every v G V, then (u,u) = 0 and hence a = 0 by \h). 

Now suppose TV is any subset of V. The orthogonal complement of TV, denoted by TV 1 
(read "TV perp") consists of those vectors in V which are orthogonal to every w GW: 

W 1 - = {v G V : {v, w) = 0 for every w G TV} 

We show that TV X is a subspace of V. Clearly, 0 G TV 1 . Now suppose u,v G TV X . Then 
for any a,b G K and any w GW, 

{au + bv, w) = a(u, w) + «;> = a • 0 + b • 0 = 0 
Thus au + bv G TV 1 and therefore TV is a subspace of V. 

Theorem 13.2: Let TV be a subspace of V. Then V is the direct sum of TV and TV X , i.e. 
V = TV © TV X . 

Now if TV is a subspace of V, then V = W ® W x by the above theorem; hence there is 
a unique projection E w : V -» 7 with image TV and kernel 
TV 1 . That is, if v GV and v = w + w', where w GW, 
w' G TV X , then E w is defined by E w {v) — w. This mapping 
Ew is called the orthogonal projection of V onto TV. 



Example 13.8: Let W be the z axis in R 3 , i.e. 

W = {(0,0,c): cGR} 
Then is the a;j/ plane, i.e. 



W 



W X = {(a, b, 0) : a, b e R} / / i 



As noted previously, R 3 = W © PF 1 . The 
orthogonal projection E of R 3 onto W is given 
by E(x,y,z) = (0,0, z). 
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Example 13.9: Consider a homogeneous system of linear equations over R: 

a u xi + a l2 x 2 + • • • + a ln x n = 0 

02!^! + 022*2 + " ' - + a 2n%n = 0 



a ml*l + «m2*2 + 

or in matrix notation AX = 0. Recall that the solution space W may be viewed 
as the kernel of the linear operator A. We may also view W as the set of all vectors 
v = (x 1( . . ., «„) which are orthogonal to each row of A. Thus W is the orthogonal 
complement of the row space of A. Theorem 13.2 then gives another proof of the 
fundamental result: dim W = n — rank (A). 

Remark: If V is a real inner product space, then the angle 6 between nonzero vectors 
u, v E V is denned by 

(u, v) 

cos 0 = 



HIIMI 



By the Cauchy-Schwarz inequality, -1 ^ cos B ^ 1 and so the angle 6 always 
exists. Observe that u and v are orthogonal if and only if they are "perpendicu- 
lar", i.e. 0 = tt/2. 



ORTHONORMAL SETS 

A set {ui) of vectors in V is said to be orthogonal if its distinct elements are orthogonal, 
i.e. if {Ui, Uj) = 0 for i j. In particular, the set is said to be orthonormal if it is 
orthogonal and if each Ui has length 1, that is, if 

fO for i¥= j 
{Ui,Uj} = Sn = < 

[1 for i = j 

An orthonormal set can always be obtained from an orthogonal set of nonzero vectors by 
normalizing each vector. 

Example 13.10: Consider the usual basis of Euclidean 3-space R 3 : 

{ ei = (1, 0, 0), e 2 = (0, 1, 0), e 3 = (0, 0, 1)} 

It is clear that 

(e„ e t > = (e 2 , e 2 ) = <e 3 , e 3 > = 1 and <e i( e } ) = 0 for i ¥= j 

That is, {e u e 2 , e 3 } is an orthonormal basis of R 3 . More generally, the usual basis 
of R" or of C" is orthonormal for every n. 

Example 13.11: Let V be the vector space of real continuous functions on the interval —w-t-w 
with inner product denned by </,£>= f f(t)g(t)dt. The following is a classi- 

cal example of an orthogonal subset of V: 

{1, cos t, cos 2t, .... sin t, sin 2t, . . .} 
The above orthogonal set plays a fundamental role in the theory of Fourier series. 

The following properties of an orthonormal set will be used in the next section. 
Lemma 13.3: An orthonormal set {u u . . .,Ur} is linearly independent and, for any v € V, 
the vector w = v ( VtUl ) Ul - {v,u2)u 2 - • • • - {v,u r )u r 
is orthogonal to each of the im. 
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GRAM-SCHMIDT ORTHOGONALIZATION PROCESS 

Orthonormal bases play an important role in inner product spaces. The next theorem 
shows that such a basis always exists; its proof uses the celebrated Gram-Schmidt orthog- 
onalization process. 

Theorem 13.4: Let {v h . . .,v n } be an arbitrary basis of an inner product space V. Then 
there exists an orthonormal basis {u\, . . . , tin) of V such that the transition 
matrix from {v { } to {%} is triangular; that is, for i = 1, . . . , n, 

Ui = <hiVi + OiiVz + ■ • • + anVi 

Proof. We set Ui — Vi/||ih||; then {u x } is orthonormal. We next set 
w 2 = v 2 — {v 2 , and u 2 = w 2 /\\w 2 \\ 

By Lemma 13.3, w 2 (and hence u 2 ) is orthogonal to u\\ then {ui,ve} is orthonormal. We next 



w 3 = v 3 — (Vi, iii)ui — (vz, u 2 )u 2 and «s = tt>8/||w>s|| 

Again, by Lemma 13.3, Ws (and hence u 3 ) is orthogonal to ui and u 2 ; then {«i, Uz, u s ) is ortho- 
normal. In general, after obtaining {tti, . . . , im) we set 



(Note that w i+ i^0 because v i+ i & L(v u -,Vi) — L(u h .. .,%).) As above, {ui, . . .,u l+ i} 
is also orthonormal. By induction we obtain an orthonormal set {ui, . . . , Un) which is in- 
dependent and hence a basis of V. The specific construction guarantees that the transition 
matrix is indeed triangular. 

Example 13.12: Consider the following basis of Euclidean space B 3 : 

{v t = (1, 1, 1), v 2 = (0, 1, 1), v s = (0, 0, 1)} 

We use the Gram-Schmidt orthogonalization process to transform into an ortho- 
normal basis {u^. First we normalize v lt i.e. we set 



set 



Wi + i = Vi+i — (V i+u Ui)Ui 



(v i+ i,Ui)Ui and v*+i = 



Next we set 





and then we normalize w 2 , i.e. we set 

w 2 



2 11 
76' Vg'Vg 




IKIl 



Finally we set 

w 3 = v 3 — (v&u^ - (v 3 ,u 2 )u 2 



= (0,0,1) - 



J_/ J_ _1_ J_ 



J_/__2_ J_ JL_ 
V6\ V$ ' V$ 



) 




1 1 
2'2 



and then we normalize w 3 : 




The required orthonormal basis of R 3 is 



LINEAR FUNCTIONALS AND ADJOINT OPERATORS 

Let V be an inner product space. Each u € V determines a mapping u : V ■* K defined by 

u(v) — {v,u) 
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Now for any a, b G K and any vi, v 2 G V, 

u(av 1 + bv 2 ) - {avi + bv 2 ,u) = a(vi, u) + b(v 2 , u) = au(vi) + bu(v 2 ) 

That is, u is a linear functional on V. The converse is also true for spaces of finite dimen- 
sion and is an important theorem. Namely, 

Theorem 13.5: Let </> be a linear functional on a finite dimensional inner product space V. 

Then there exists a unique vector u G V such that <j>(v) = (v, u) for every 
vGV. 

We remark that the above theorem is not valid for spaces of infinite dimension (Problem 
13.45), although some general results in this direction are known. (One such famous result 
is the Riesz representation theorem.) 

We use the above theorem to prove 

Theorem 13.6: Let T be a linear operator on a finite dimensional inner product space V. 
Then there exists a unique linear operator T* on V such that 

(T{u),v) = (u,T*(v)) 

for every u, v G V. Moreover, if A is the matrix of T relative to an 
orthonormal basis {&} of V, then the conjugate transpose A* of A is the 
matrix of T* in the basis {d}. 

We emphasize that no such simple relationship exists between the matrices representing 
T and T* if the basis is not orthonormal. Thus we see one useful property of orthonormal 
bases. 

Definition: A linear operator T on an inner product space V is said to have an adjoint 
operator T* on V if (T{u),v) = (u,T*{v)) for every u, v G V. 

Thus Theorem 13.6 states that every operator T has an adjoint if V has finite dimension. 
This theorem is not valid if V has infinite dimension (Problem 13.78). 

Example 13.13: Let T be the linear operator on C 3 defined by 

T(x, y, z) = (2x + iy,y- Uz, x + (1 - i)y + Zz) 
We find a similar formula for the adjoint T* of T. Note (Problem 7.3) that the 
matrix of T in the usual basis of C 3 is 

/2 i 0 
[T] = 0 1 -5i 
\1 1-i 3 

Recall that the usual basis is orthonormal. Thus by Theorem 13.6, the matrix of T* 
in this basis is the conjugate transpose of [T\: 

IT*] = 

Accordingly, 

T*(x,y,z) = (2x + z,-ix + y + (l + i)z,5iy + Bz) 

The following theorem summarizes some of the properties of the adjoint. 

Theorem 13.7: Let S and T be linear operators on V and let k G K. Then: 

(i) (S + T)* = S* + T* (iii) (ST)* = T*S* 

(ii) (kT)* = kT* (iv) (T*)* = T 
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ANALOGY BETWEEN A(V) AND C, SPECIAL OPERATORS 

Let A(V) denote the algebra of all linear operators on a finite dimensional inner product 
space V. The adjoint mapping T h» T* on A(V) is quite analogous to the conjugation map- 
ping z z on the complex field C. To illustrate this analogy we identify in the following 
table certain classes of operators T G A(V) whose behavior under the adjoint map imitates 
the behavior under conjugation of familiar classes of complex numbers. 



Class of 
complex numbers 


Behavior under 
conjugation 


Class of 
operators in A(V) 


Behavior under 
the adjoint map 


Unit circle (|z| = 1) 


z = 1/z 


Orthogonal operators (real case) 
Unitary operators (complex case) 


t* = r _i 


Real axis 


z = z 


Self-adjoint operators 

Also called: 

symmetric (real case) 
Hermitian (complex case) 


r* = t 


Imaginary axis 


z — — z 


Skew-adjoint operators 

Also called: 

skew-symmetric (real case) 
skew-Hermitian (complex case) 


T* = -T 


Positive half axis 

(0,00) 


z = ww, w ¥* 0 


Positive definite operators 


T = S*S 
with £ nonsingular 



The analogy between these classes of operators T and complex numbers z is reflected in 
the following theorem. 

Theorem 13.8: Let A be an eigenvalue of a linear operator T on V. 

(i) If T* = T~\ then |A| = 1. 

(ii) If T* = T, then A is real. 

(iii) If T* = — T, then A is pure imaginary. 

(iv) If T = S*S with S nonsingular, then A is real and positive. 

We now prove the above theorem. In each case let v be a nonzero eigenvector of T 
belonging to A, that is, T(v) - to with v ¥° 0; hence (v, v) is positive. 

Proof of (i): We show that Xk(v, v) - (v,v): 

\\(v,v) = (to, to) - (T(v),T(v)) = (v,T*T(v)) = (v,I{v)) = (v,v) 

But (v, v) ¥> 0; hence AA = 1 and so |A| = 1. 

Proof of (ii): We show that \{v,v) = \{v,v): 

k(v,v) = (kv,v) = (T(v),v) = (v,T*(v)) = (v,T{v)) = (v,to) = X(v,v) 

But (v, v) ¥> 0; hence A = A and so A is real. 

Proof of (iii): We show that \{v, v) = — \(v,v): 

X(v,v) = (\v,v) = (T{v),v) = (v,T*(v)) = (v,-T(v)) = (v, -to) = -X(v,v) 

But (v,v) ¥- 0; hence A = - A or 1 = -A, and so A is pure imaginary. 
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Proof of (iv): Note first that S(v) ¥° 0 because S is nonsingular; hence (S{v),S(v)) is 
positive. We show that A(v, v) = (S(t;),iS(v)>: 

A<v,v> - (Xv, v) = <T(t>),t;) = (S*S(v),v) = <S(v),S(i;)> 

But <«,i;> and (S(v),S(v)) are positive; hence A is positive. 

We remark that all the above operators T commute with their adjoint, that is, 
TT* - T*T. Such operators are called normal operators. 



ORTHOGONAL AND UNITARY OPERATORS 

Let U be a linear operator on a finite dimensional inner product space V. As defined 
above, if 

U* = U~ l or equivalently UU* — U*U — I 

then U is said to be orthogonal or unitary according as the underlying field is real or com- 
plex. The next theorem gives alternate characterizations of these operators. 

Theorem 13.9: The following conditions on an operator U are equivalent: 

(i) U* = U- 1 , that is, UU* = U*U = I. 

(ii) U preserves inner products, i.e. for every v, w G V, 

(U(v),U(w)) = (v,w) 

(iii) U preserves lengths, i.e. for every v G V, \\U(v)\\ = \\v\\. 



Example 13.14: Let T : R 3 -* R 3 be the linear operator which 
rotates each vector about the z axis by a fixed 
angle 6: 

T(x, y, z) = (x cos 0 — y sin *, 

* sin 9 + y cos », z) 

Observe that lengths (distances from the ori- 
gin) are preserved under T. Thus T is an 
orthogonal operator. 




Example 13.15: Let V be the J 2 -space 0 f Example 13.5. Let T : V -* V be the linear operator de- 
fined by T(a l ,a i ,...) = (0,a l ,a 2 ,...). Clearly, T preserves inner products and 
lengths. However, T is not surjective since, for example, (1,0,0, .. .) does not belong 
to the image of T; hence T is not invertible. Thus we see that Theorem 13.9 is not 
valid for spaces of infinite dimension. 

An isomorphism from one inner product space into another is a bijective mapping 
which preserves the three basic operations of an inner product space: vector addition, 
scalar multiplication, and inner products. Thus the above mappings (orthogonal and 
unitary) may also be characterized as the isomorphisms of V into itself. Note that such a 
mapping U also preserves distances, since 

\\U(v)-U(w)\\ = \\U(v-w)\\ = \\v-w\\ 
and so U is also called an isometry. 



ORTHOGONAL AND UNITARY MATRICES 

Let U be a linear operator on an inner product space V. By Theorem 13.6 we obtain the 
following result when the base field K is complex. 

Theorem 13.10A: A matrix A with complex entries represents a unitary operator U 
(relative to an orthonormal basis) if and only if A* = A -1 . 

On the other hand, if the base field K is real then A* = A*; hence we have the follow- 
ing corresponding theorem for real inner product spaces. 
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Theorem 13.10B: A matrix A with real entries represents an orthogonal operator U 
(relative to an orthonormal basis) if and only if A* = A -1 . 

The above theorems motivate the following definitions. 

Definition: A complex matrix A for which A* = A -1 , or equivalently A A* = A* A - I, 
is called a unitary matrix. 

Definition: A real matrix A for which A t — A~ 1 , or equivalently AA* = A* A = I, is 
called an orthogonal matrix. 

Observe that a unitary matrix with real entries is orthogonal. 

Example 13.16: Suppose A = ( " 2 ) is a unitary matrix. Then AA* = 7 and hence 



6, b 



AA „ = /a, aA/di IA _ /|oi| 2 +N 2 a.h + c^bA _ l\ 0 \ _ f 



bi b 2 l\d2 b 2 J \a 1 b 1 + d 2 b 2 |&il 2 +|&2l 2 / \0 1 
Thus 

Kl 2 + H 2 = 1. I&il 2 + N 2 = 1 and + a 2 b 2 = 0 

Accordingly, the rows of A form an orthonormal set. Similarly, A* A = I forces 
the columns of A to form an orthonormal set. 

The result in the above example holds true in general; namely, 

Theorem 13.11: The following conditions for a matrix A are equivalent: 

(i) A is unitary (orthogonal). 

(ii) The rows of A form an orthonormal set. 

(iii) The columns of A form an orthonormal set. 

Example 13.17: The matrix A representing the rotation T in Example 13.14 relative to the usual 
basis of R 3 is 

(cos e — sin e 0 
sin 8 cos 9 0 
0 0 1, 

As expected, the rows and the columns of A each form an orthonormal set; that is, 
A is an orthogonal matrix. 



CHANGE OF ORTHONORMAL BASIS 

In view of the special role of orthonormal bases in the theory of inner product spaces, 
we are naturally interested in the properties of the transition matrix from one such basis 
to another. The following theorem applies. 

Theorem 13.12: Let {e ir . . .,e n } be an orthonormal basis of an inner product space V. 

Then the transition matrix from {ei} into another orthonormal basis is 
unitary (orthogonal). Conversely, if P = (ay) is a unitary (orthogonal) 
matrix, then the following is an orthonormal basis: 

{ei = auei + a M e 2 + • • • + a^en : i = 1, . . . , n} 

Recall that matrices A and B representing the same linear operator T are similar, i.e. 
B — P _1 AP where P is the (nonsingular) transition matrix. On the other hand, if V is 
an inner product space, we are usually interested in the case when P is unitary (or orthog- 
onal) as suggested by the above theorem. (Recall that P is unitary if P* = P~\ and P is 
orthogonal if P* = P~\) This leads to the following definition. 
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Definition: Complex matrices A and B are unitarily equivalent if there is a unitary matrix 
P for which B = P*AP. Analogously, real matrices A and B are orthogonally 
equivalent if there is an orthogonal matrix P for which B = P*AP. 

Observe that orthogonally equivalent matrices are necessarily congruent (see page 262). 



POSITIVE OPERATORS 

Let P be a linear operator on an inner product space V. P is said to be positive (or: 

semi-definite) if _ . „ 

P - S*S for some operator S 

and is said to be positive definite if S is also nonsingular. The next theorems give alternate 
characterizations of these operators. 

Theorem 13.13 A: The following conditions on an operator P are equivalent: 

(i) P = T 2 for some self -ad joint operator T. 

(ii) P = S*S for some operator S. 

(iii) P is self -adjoint and (P{u),u) ^ 0 for every u £ V. 

The corresponding theorem for positive definite operators is 

Theorem 13.13B: The following conditions on an operator P are equivalent: 

(i) P = T i for some nonsingular self -ad joint operator T. 

(ii) P = S*S for some nonsingular operator S. 

(iii) P is self -adjoint and (P{u),u) > 0 for every u ¥• 0 in V. 



DIAGONALIZATION AND CANONICAL FORMS IN EUCLIDEAN SPACES 

Let T be a linear operator on a finite dimensional inner product space V over K. Rep- 
resenting T by a diagonal matrix depends upon the eigenvectors and eigenvalues of T, 
and hence upon the roots of the characteristic polynomial A(t) of T (Theorem 9.6). Now 
A(t) always factors into linear polynomials over the complex field C, but may not have any 
linear polynomials over the real field R. Thus the situation for Euclidean spaces (where 
K = K) is inherently different than that for unitary spaces (where K = C); hence we treat 
them separately. We investigate Euclidean spaces below, and unitary spaces m the next 



section. 



Theorem 13.14: Let T be a symmetric (self-adjoint) operator on a real finite dimensional 
inner product space V. Then there exists an orthonormal basis of V 
consisting of eigenvectors of T; that is, T can be represented by a 
diagonal matrix relative to an orthonormal basis. 



We give the corresponding statement for matrices. 

Alternate Form of Theorem 13.14: Let A be a real symmetric matrix. Then there exists 

an orthogonal matrix P such that B = p-^AP = P*AP 
is diagonal. 

We can choose the columns of the above matrix P to be normalized orthogonal eigen- 
vectors of A; then the diagonal entries of B are the corresponding eigenvalues. 
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/ 2 -2\ 

Example 13.18: Let A = I J . We find an orthogonal matrix P such that P l AP is diagonal. 



The characteristic polynomial A(t) of A is 

t - 2 2 



A(t) = \tI-A\ = 



2 £ - 5 



(t-6)(t-l) 



The eigenvalues of A are 6 and 1. Substitute t = 6 into the matrix tl — A to 
obtain the corresponding homogeneous system of linear equations 

4x + 2j/ = 0, 2z + 1/ = 0 

A nonzero solution is v x = (1, —2). Next substitute t = 1 into the matrix tl — A 
to find the corresponding homogeneous system 

-x + 2y = 0, 2a; - 4j/ = 0 

A nonzero solution is (2, 1). As expected by Problem 13.31, v t and v 2 are orthogonal. 
Normalize and v 2 to obtain the orthonormal basis 

{m, = (1A/5, -2A/5), m 2 = (2/V5, l/>/5)} 

Finally let P be the matrix whose columns are u x and u 2 respectively. Then 

/ 1/VE 2/Vs\ /6 0 

P = and P-iAP = PtAP = ( 

V-2/V5 1/V5/ \° 1 

As expected, the diagonal entries of P*AP are the eigenvalues corresponding to the 
columns of P. 

We observe that the matrix B = P^AP = P*AP is also congruent to A. Now if q is 
a real quadratic form represented by the matrix A, then the above method can be used 
to diagonalize q under an orthogonal change of coordinates. This is illustrated in the 
next example. 

Example 13.19: Find an orthogonal transformation of coordinates which diagonalizes the quadratic 
form q(x, y) = 2x 2 — 4xy + 5y 2 . 

/ 2 — 2\ 

The symmetric matrix representing q is A = I ). In the preceding 

example we obtained the orthogonal matrix >■ — ' 

p / W5 2AM forwMch pt / 6 o 

^-2/^5 1/-/5/ ^° 1 

(Here 6 and 1 are the eigenvalues of A.) Thus the required orthogonal transforma- 
tion of coordinates is 

fx\ fx'\ x = x'/V5 + 2y'V5 

[ ) = P ( ,. , ) that is, ~ r_ 

Under this change of coordinates q is transformed into the diagonal form 

q(x',y>) = Gx' 2 + y' 2 

Note that the diagonal entries of q are the eigenvalues of A. 

An orthogonal operator T need not be symmetric, and so it may not be represented by 
a diagonal matrix relative to an orthonormal basis. However, such an operator T does have 
a simple canonical representation, as described in the next theorem. 

Theorem 13.15: Let T be an orthogonal operator on a real inner product space V. Then 
there is an orthonormal basis with respect to which T has the following 
form: 
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I cos 6 r — sin Or 
| sin 0 r cos Or 



The reader may recognize the above 2 by 2 diagonal blocks as representing rotations in 
the corresponding two-dimensional subspaces. 



DIAGONALIZATION AND CANONICAL FORMS IN UNITARY SPACES 

We now present the fundamental diagonalization theorem for complex inner product 
spaces, i.e. for unitary spaces. Recall that an operator T is said to be normal if it com- 
mutes with its adjoint, i.e. if TT* = T* T. Analogously, a complex matrix A is said to be 
normal if it commutes with its conjugate transpose, i.e. if AA* = A* A. 

Example 13.20: Let A = (\ „ , 1 „.V Then 

\i 3 + 2i j 

AA* = f 1 1 V 1 ~M = ( 2 3 " 3 ^ 

\i 3 + 2i/Vl 3 — 2i / V 3 + 3 ' 14 / 

AM = ( X ~* V 1 M = ( 2 3 ~ 3 ^ 
\1 8-2t/\* 3 + 2i/ V 3 + 3i I 4 / 

Thus A is a normal matrix. 

The following theorem applies. 

Theorem 13.16: Let T be a normal operator on a complex finite dimensional inner product 
space V. Then there exists an orthonormal basis of V consisting of 
eigenvectors of T; that is, T can be represented by a diagonal matrix 
relative to an orthonormal basis. 

We give the corresponding statement for matrices. 

Alternate Form of Theorem 13.16: Let A be a normal matrix. Then there exists a uni- 
tary matrix P such that B — P~ X AP — P*AP is diagonal. 

The next theorem shows that even non-normal operators on unitary spaces have a 
relatively simple form. 

Theorem 13.17: Let T be an arbitrary operator on a complex finite dimensional inner 
product space V. Then T can be represented by a triangular matrix 
relative to an orthonormal basis of V. 
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Alternate Form of Theorem 13.17: Let A be an arbitrary complex matrix. Then there 
exists a unitary matrix P such that B = P~ l AP = P*AP is triangular. 

SPECTRAL THEOREM 

The Spectral Theorem is a reformulation of the diagonalization Theorems 13.14 and 13.16. 

Theorem 13.18 (Spectral Theorem): Let T be a normal (symmetric) operator on a com- 
plex (real) finite dimensional inner product space V. Then there exist 
orthogonal projections E it . . .,E r on V and scalars Xi, . . .,X r such that 

(i) T = XlEi + x 2 E 2 + • • • + XrE r 

(ii) Ei + E 2 + ■ ■ ■ + E r = I 

(iii) EiEs = 0 for i ¥= j. 

The next example shows the relationship between a diagonal matrix representation and 
the corresponding orthogonal projections. 



Example 13.21 : Consider a diagonal matrix, say A = \ . Let 



E, = 



The reader can verify that the E { are projections, i.e. e\ — E { , and that 

(i) A = 2E t + 3E 2 + 5E S , (ii) E l +E 2 + E s = I, (iii) E^, = 0 for i ¥= j 













)• '.-( 









Solved Problems 

INNER PRODUCTS 

13.1. Verify the relation (u, av x + bv 2 ) = d{u, v{) + b(u, v 2 ). 

Using [I 2 ], [7j] and then [7 2 ], we find 



(u, + bv 2 ) = (ov, + &V.J, m> = a(v u u) + b(v 2 , u) 

= d(v v u) + b(v 2 ,u) = a^Vj) + b{u,v 2 ) 

13.2. Verify that the following is an inner product in R 2 : 

(u, v) = xiy! - xiy 2 - x 2 y x + 3x 2 y 2 , where u = (x u x 2 ), v = (yi, y 2 ). 

Method 1. 

We verify the three axioms of an inner product. Letting w = (a lt z 2 ), we find 
au + bw = a(x lt x 2 ) + b(z lt = (axi + 6z 1( ax 2 + bz 2 ) 
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Thus 

(au + bw, v) = ((aXi + bz lt ax 2 + 6*2). (Vi> V2)) 

= (ax! + bzjy! — (arc, + bz l )y 2 - (ax 2 + bz 2 )y 1 + 3(ax 2 + bz 2 )y 2 
= aixtf! - x x y 2 - x 2 V! + 3x 2 y 2 ) + &(«i2/i - «i3/ 2 ~ z 2Vi + 3z 2 y 2 ) 
= a(u, v) + b{w, v) 
and so axiom [/^ is satisfied. Also, 

(v,u) = y^x x - y x x 2 — y 2 x x + Zy 2 x 2 - x l y 1 - x x y 2 - x 2 y r + Sx 2 y 2 = (u,v) 
and axiom [7 2 ] is satisfied. Finally, 

(u,u) = x\ - 2x t x 2 + Zx\ - x\ - 2x x x 2 + x\ + 2x\ = {x t - x 2 ) 2 + 2x\ - 0 
Also, {u, u) — 0 if and only if x 1 = 0, x 2 — 0, i.e. u = 0. Hence the last axiom [I s ] is satisfied. 
Method 2. 

We argue via matrices. That is, we can write (u, v) in matrix notation: 
{ u,v) = u'Av = (x lt x 2 )(_\ 

and so [/j] holds. Since A is symmetric, [1 2 ] holds. Thus we need only show that A is positive 
definite. Applying the elementary row operation R 2 -» R l + R 2 and then the corresponding ele- 
mentary column operation C 2 -> C 1 + C 2 , we transform A into diagonal form (\ ^ V Thus A 
is positive definite and [7 3 ] holds. ^ ' 



13.3. Find the norm of v = (3, 4) G R 2 with respect to: 

(i) the usual inner product, (ii) the inner product in Problem 13.2. 

(i) = (v,v) = <(3,4), (3,4)) = 9 + 16 = 25; hence ||v|| = 5. 

(ii) = (v,v) = <(3,4), (3,4)> = 9 - 12 - 12 + 48 = 33; hence \\v\\ = y/33. 

13.4. Normalize each of the following vectors in Euclidean space R 3 : 

(i) u = (2,1,-1), (ii) v = (hh~D- 

(i) Note <m, u) is the sum of the squares of the entries of u; that is, (u, u) - 2 2 + l 2 + (— l) 2 = 6. 
Hence divide u by = y/{u, u) = V6 to obtain the required unit vector: 

ul\\u\\ = (2/yfl, 1/V6, -l/\/6) 

(ii) First multiply v by 12 to "clear" of fractions: 12v = (6, 8, -3). We have <12v, 12v) = 
6 2 + 8 2 + (— 3) 2 = 109. Then the required unit vector is 

12v/||12v|| = (6/VT09, 8/VI09, -3/\/Ib9 ) 



13.5. Let V be the vector space of polynomials with inner product given by (f,g) = 
£f(t)g(t)dt. Let f(t) = t + 2 and g(t) = « 2 - 2t - 3. Find (i) (f,g) and (ii) ||/||. 

(i) (f,9) = j\t + 2W ~ U ~ 3) dt = |^tV4 - 7t 2 /2 - 6t J = -37/4 

(ii) </,/> = f (t + 2)(t + 2)dt = 19/3 and ||/|| = vW> = VW3 
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13.6. Prove Theorem 13.1 (Cauchy-Schwarz): \(u,v)\ ^ \\u\\ \\v\\. 

If v = 0, the inequality reduces to 0 — 0 an d hen ce is valid. Now suppose v # 0. Using 
zz = \z\ 2 (for any complex number z) and (v,u) = (u,v), we expand \\u — (u, v)tv\\ 2 — 0 where t 
is any real value: 

0 — \\u — (u, v)tv\\ 2 = (u — (u,v)tv, u — (u,v)tv) 

= (u,u) — (u, v)t(u, v) — (u,v)t(v,u) + (u, v)(u, v)t 2 (v, V) 
= \\u\\ 2 - 2t\(u,v)\* + \(u,v)\n*\\v[\2 

Set t = l/\\v\\ 2 to find 0 \\u\\ 2 - ^jjf , from which |(u,«>P ^ ||m|| 2 ||«|| 2 . Taking the square 
root of both sides, we obtain the required inequality. 

13.7. Prove that the norm in an inner product space satisfies the following axioms: 
[Ni]: \\v\\ 0; and ||v|| = 0 if and only if v = 0. 

[N 2 \. \\kv\\ = |fe| |H|. 

[Ns]: \\u + v\\^\\u\\ + \\v\\. 

By [/ 3 ], (v, v) s= 0; hence \\v\\ = \/(v, v) - 0. Furthermore, = 0 if and only if (v, v) = 0, 
and this holds if and only if v = 0. Thus [A^] is valid. 

We find \\kv\\ 2 = (kv, kv) = kk(v,v) = \k\ 2 \\v\\ 2 . Taking the square root of both sides gives [N 2 ]- 
Using the Cauchy-Schwarz inequality, we obtain 

[ |zt -f- v| I 2 = {u + v, u + v) — (u,u) + (u,v) + (u, v) + (v,v) 

^ IMP + 2|HIIMI + IMP = (INI + IMI) 2 

Taking the square root of both sides yields [N s ]. 

Remark: [N s ] is frequently called the triangle inequality because 
if we view u + v as the side of the triangle formed with u and v 
(as illustrated on the right), then [N s ] states that the length of one 
side of a triangle is less than or equal to the sum of the lengths 
of the other two sides. 

ORTHOGONALITY 

13.8. Show that if u is orthogonal to v, then every scalar multiple of u is also orthogonal 
to v. Find a unit vector orthogonal to Vi = (1, 1, 2) and v 2 = (0, 1, 3) in R 3 . 

If <m, v) = 0 then (fcw, v) = k{u, v) = k • 0 = 0, as required. Let w — (a;, y, z). We want 

0 = (w,v{) = x + y + 2z and 0 = (w,v 2 ) = y + 3z 
Thus we obtain the homogeneous system 

x + y + 2z = 0, y + Sz = 0 
Set z — 1 to find y = — 3 and x = 1; then w = (1,-3,1). Normalize w to obtain the required 
unit vector w' orthogonal to v t and v 2 : w' = w/||w|| = (l/yTl, — 3/vTl, 1/Vli)- 

13.9. Let W be the subspace of R 5 spanned by u = (1, 2, 3, -1, 2) and v = (2, 4, 7, 2, -1). 
Find a basis of the orthogonal complement W 1 of W. 

We seek all vectors w = (x, y, z, s, t) such that 

(w, u) = x + 2y + 3z - s + 2t = 0 
<w, v) = 2x + 4y + 7z + 2s — t = 0 
Eliminating x from the second equation, we find the equivalent system 

x + 2y + 3z — s + 2t = 0 
z + 4s - 5t = 0 

The free variables are y, s and t. Set y = —1, s = 0, t = 0 to obtain the solution w t = (2, —1, 0, 0, 0). 
Set y = 0, s = 1, t = 0 to find the solution w 2 = (13,0,-4,1,0). Set y = 0, a = 0, t = 1 to obtain 
the solution w 3 = (—17,0,5,0,1). The set {w lt w 2 ,w 3 } is a basis of . 
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13.10. Find an orthonormal basis of the subspace W of C 3 spanned by Vi — (1, i, 0) and 
tf 2 = (l,2,l-i). 

Apply the Gram-Schmidt orthogonalization process. First normalize v v We find 

IKII 2 = <Vi»»i) = 1*1 + *■(-*) + 0*0 = 2 andso 1KH = V2 
Thus % = = t/\/2, 0). 

To form w 2 = v 2 — (v 2 , u t )u u first compute 

(v 2 , Ml > = <(1, 2, 1 - i), (1A/2, VV2, 0)> = 1/a/2 - 2i/V2 = (1 - 2t)/V2 

m, /•. o i 1 — 2t / 1 i . \ / 1 + 2i 2 — i , . \ 

W2 = (1,2.1-.)-— (^.^.0j = (-t_,— ,l- l ) 

Next normalize w 2 or, equivalently, 2w 2 = (1 + 2i, 2 — i, 2 — 2i). We have 

= <2i« 1 ,2w 1 ) = (1 + 2i)(l - 2i) + (2 - 1)(2 + i) + (2 - 2i)(2 + 2i) = 18 
and \\2wjW = VTlB. Thus the required orthonormal basis of W is 

( _/i i \ 2w, _ / i + 2t 2-t 2-2i\-l 

r - 1 vf v?t M2 " iw " 1 ' vis * vis Ji 

13.11. Prove Lemma 13.3: An orthonormal set {u u ...,«,} is linearly independent and, for 
any v GV, the vector 

W = V - {V,Ui)Ui - {V,U2)U 2 - • • • - (V,U r )Ur 

is orthogonal to each of the im. 

Suppose a 1 u 1 + ••• + a r u r = 0. Taking the inner product of both sides with respect to u u 
0 = <0, u{) = (OjM! + • • • + a r u r , «!> 

= »i<«i» M l> + a 2( u 2> «1> + • • • + a r < M r. M l> 

= a t • 1 + o 2 • 0 + • • ■ + a r • 0 = a t 
or a t = 0. Similarly, for i = 2, . . . , r, 

0 = <0, Mj) = H + a r « T , w { ) 

= 0 1 <M 1 ,M J > + ••• + Oi<Mj,Mi> + ••• + <M M r, M i> = °i 

Accordingly, {%, . . . , w r } is linearly independent. 

It remains to show that w is orthogonal to each of the w 4 . Taking the inner product of w with 
respect to u x , 

<«>,%> = (V,U{) - <f,Wl><«l,ttl> - <V, M 2 )<M 2 , Wj) - •■• - (V, U r )(U r , Mj) 

= (■y.Mj) - <?,«!> «1 - <v,m 2 )-0 - ••• - (v,u r )-0 = 0 
That is, w is orthogonal to %. Similarly, for i = 2, . . . ,r, 

<W,Mi) = <V,Mi) - <U,Ml)<Wl,Wi) - • • • - <V,M i )(M i ,M i ) - • • • - (-U , U r )(,U r , Mj) = 0 

Thus w is orthogonal to u t for i = 1, . . .,r, as claimed. 

13.12. Let W be a subspace of an inner product space V. Show that there is an orthonormal 
basis of W which is part of an orthonormal basis of V. 

We choose a basis {v u . . .,v r } of W and extend it to a basis {v lt ...,v n } of V. We then apply 
the Gram-Schmidt orthogonalization process to {v lt ...,v n } to obtain an orthonormal basis 
{ Ul M n } of V where, for i = 1, ...,», « s = + • ■ • + %« { . Thus W and there- 

fore {%, . . . , u r } is an orthonormal basis of W. 
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13.13. Prove Theorem 13.2: Let W be a subspace of V; then V - W © W x . 

By Problem 13.12 there exists an orthonormal basis {u u . . . , u r } of W which is part of an ortho- 
normal basis {w 1( . . .,u n } of V. Since {u u . . .,%} is orthonormal, u r+1 , ...,»„£ W 1 - . If v &V, 

v = OjMj + • • ■ + a n u n where + • • • + a r u r G W, a r+1 w r + 1 + • • • + a n u n & W 1 ' 

Accordingly, V = W +W ± . 

On the other hand, if wEWflW 1 , then (w,w) = 0. This yields w = 0; hence WriW X = {0}. 

The two conditions, V — W + W ± and WnW 1 = {0}, give the desired result V = W © W X . 

Note that we have proved the theorem only for the case that V has finite dimension; we remark 
that the theorem also holds for spaces of arbitrary dimension. 



13.14. Let W be a subspace of W. Show that W c W , and that W = W when V 
has finite dimension. 

Let wGW. Then (w, v) = 0 for every vGW L ; hence we.W LL . Accordingly, WcW ±X . 
Now suppose V has finite dimension. By Theorem 13.2, V = W © W 1 and, also, V = 
W X © W- 1 - 1 . Hence 

dim W = dim V - dim W" 1 and dim W 11 = dim V - dim W 1 - 
This yields dim W = dimW L± . But ^dr 11 by the above; hence W= W 11 , as required. 



13.15. Let {ei, . . . , e n } be an orthonormal basis of V. Prove: 

(i) for any u G V, u = (u, e 1 )e 1 + (u, e 2 )e2 +•••+<«, e n )e n ; 

(ii) (a-iei + • • • + a„e„, bid + • • • + b n e n ) = + 0262 + • • • + OnK; 

(iii) for any u,v G V, (u,v) = (u,ei){v7el) + • ■ ■ + <«, e„)<v, e«>; 

(iv) if T : V -» F is linear, then (T(ej), e f ) is the i?'-entry of the matrix A representing 
T in the given basis {ei}. 

(i) Suppose u = k 1 e 1 + k 2 e 2 + ■• • +k n e n . Taking the inner product of u with e u 

(u, ej) = < Vi + ^2«2 + 1" Ke n , ei> 

= fci<ei, e x > + k 2 (e 2 , erf + • • • + fc„<e n , 6l > 
= fej • 1 + fc 2 • 0 + • • • + k n • 0 = fci 

Similarly, for t = 2, . . . , n, 

<«, e { ) = < Vi + • • • + ftjej + • • • + fc B e„, 6 { > 

= fei<«i. e f > + • • • + k^, e t ) + ■■■ + fc n (e n , e 4 > 
= V<> + ••• + fcj-1 + ••• + k n -0 = k t 
Substituting <«, e 4 > for fcj in the equation u = k x e x + • • • + k n e n , we obtain the desired result. 



(ii) We have ( 2 2 ^i) = . 2 ,^ e,> 
But {e it e-) = 0 for i ¥> j, and <e t , e^) = 1 for i = j; hence, as required, 

<n n \ n 

{ 2 a»ei, 2 &j«y = .2 = Mi + a 2 *2 + • • • + a n b n 

(iii) By (i), w = (u, e 1 )e 1 +•••+(«, e„)e„ and v - (v, +•••+(«, e„>e„ 
Then by (ii), (m, d> = <m, «!><•», «!> + (m, e 2 ><«7i^ +•••+(«, e n )(v7<Q 
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(iv) By(i), 

T(e x ) = {Tie,), efa + (T( ei ), e 2 )e 2 + ■■■ + (T( ei ), e n )e n 
T(e 2 ) = (T{e 2 ), ei ) ei + (T(e 2 ),e 2 )e 2 + ■■■ + (T(e 2 ), e n )e n 



T(e n ) = (nej.efa + (T(e n ),e 2 )e 2 + • • • + {T(e n ),e n )e n 

The matrix A representing T in the basis {e t } is the transpose of the above matrix of co- 
efficients; hence the #-entry of A is (T(e } ), e$. 



ADJOINTS 

13.16. Let T be the linear operator on C 3 defined by 

T(x,y,z) = (2x + (l-i)y,(Z + 2i)x-4iz,2ix + (4-3i)y-3z) 
Find T*(x, y, z). 

First find the matrix A representing T in the usual basis of (see Problem 7.3): 

/ 2 1-i 0 \ 

A = j 3 + 2i 0 -4i 
\ 2i 4 - 3i -3 / 

Form the conjugate transpose A* of A: 

/ 2 3 - 2i -2i \ 

A* = 1 + i 0 4 + 3t 

\ 0 4i -3 / 

Thus 

T*(x, y, z) = (2* + (3 - 2i)y - 2iz, (1 + i)x + (4 + 3i)z, 4iy - Zz) 



13.17. Prove Theorem 13.5: Let ^ be a linear functional on a finite dimensional inner 
product space V. Then there exists a unique u 6 V such that <j>(v) = (v, u) for 
every v € V. 

Let {e u ...,«„} be an orthonormal basis of V. Set 

u = fiejei + <t>(e 2 )e 2 + • • • + </>(e n )e n 

Let u be the linear functional on V defined by u(v) = (v, u), for every v G V. Then for i = 1, . . . , n, 

M(e f ) = (e it u) = (e t , ^)e l +■■■+ ^)e n ) = <p{e x ) 

Since u and 0 agree on each basis vector, u = 0. 

Now suppose w' is another vector in V for which = (t>, w'> for every «eV. Then 

<d, u) = (v, u') or (v, u — u') = 0. In particular this is true for v = u - u' and so (u — u',u — u') = 0. 
This yields u — u' = 0 and u = «'. Thus such a vector u is unique as claimed. 



13.18. Prove Theorem 13.6: Let T be a linear operator on a finite dimensional inner product 
space V. Then there exists a unique linear operator T* on V such that (T(u), v) = 
(u, T* (v)), for every u,v <=V. Moreover, if A is the matrix representing T in an 
orthonormal basis {e ( } of V, then the conjugate transpose A* of A is the matrix rep- 
resenting T* in {e { }. 

We first define the mapping T*. Let * be an arbitrary but fixed element of V. The map 
u h» <r( M ), v) is a linear functional on V. Hence by Theorem 13.5 there exists a unique element 
v'&V such that (T(u), v) = (u, v') for every n€V. We define T*V-*V by T*(i>) = t>'. Then 
(T(u), v) = (it, T* (v)) for every w, t> G V. 
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We next show that T* is linear. For any u, v t G V, and any a,b €. K, 
(u, T*(a Vl + bv 2 )) = (T(u), av t + bv 2 ) = d{T(u), + b(T(u), v 2 ) 

= d(u, T*(vJ) + b(u, T*(v 2 )) = (u, a7 , *(v 1 ) + bT*(v 2 )) 
But this is true for every u G V; hence T*(av t + bv 2 ) = aT*(v{) + bT*(v 2 ). Thus T* is linear. 

By Problem 13.15(iv), the matrices A = (a tj ) and B = (by) representing T and T* respectively 
in the basis {e 4 } are given by a i} = (T(ej), e ; ) and b tj = (T* (e,), e 4 ). Hence 

6 sj = (T*( ej ), e { ) = (e u T*(e,)) = (T(e { ), ej ) = a~ { 
Thus B = A*, as claimed. 

13.19. Prove Theorem 13.7: Let S and T be linear operators on a finite dimensional inner 
product space V and let k G K. Then: 

(i) (S + T)* = S* + T* (iii) (ST)* = T*S* 

(ii) (kT)* = kT* (iv) (T*)* = T 

(i) For any u,v 6 V, 

<(S+r)(M),i» = (S(u) + T(m), v) = {S(u), v) + (T(u), v) = {u, S*(v)) + (u, T*M) 

= (u, S*(v) + T*(v)) = (u, (S* + T*)(v)) 
The uniqueness of the adjoint implies (S + T)* = S* + T*. 

(ii) For any u, v G V, 

{(kT)(u), v) = (kT(u), v) = k(T(u), v) = k(u, T*(v)) = (u, kT*(v)) = (u, (icT*)(v)) 

The uniqueness of the adjoint implies (kT)* = JcT*. 

(iii) For any u,v GV, 

i(ST)(u), v) = (S(T(u)), v) = (T(u), S*(v)) = (u, T*(S*(v))) = (u, (T*S*)(v)) 
The uniqueness of the adjoint implies (ST)* = T*S*. 

(iv) For any u.veV, (T*(u),v) = (v, T*(u)) = (T(v), u) = (u, T(v)) 
The uniqueness of the adjoint implies (T*)* = T. 

13.20. Show that: (i) /* = 7; (ii) 0* = 0; (iii) if T is invertible, then (T -1 )* = T*~K 

(i) For every u, v G V, (I(u), v) = (u, v) = (u, I(v)); hence /* = /. 

(ii) For every u,vGV, (0(u),v) = (0, v) = 0 = (u,0) = (u,0(v)); hence 0* = 0. 

(iii) 1 = 1* = (7T-i)* = (r-i)*r*; hence (T' 1 )* = T*~K 

13.21. Let T be a linear operator on V, and let W be a T-invariant subspace of V. Show 
that W 1 is invariant under T*. 

Let u G W ± . If w G W, then T(w) G W and so (w, T*(u)} = (T(w), u) = 0. Thus T*(u) G 
since it is orthogonal to every w G W. Hence W X is invariant under T*. 

13.22. Let Tbea linear operator on V. Show that each of the following conditions implies 
T = 0: 

(i) (T(u), v) = 0 for every u,v GV; 

(ii) V is a complex space, and (T(u),u) — 0 for every u G V; 

(iii) T is self -adjoint and (T(u),u) = 0 for every u GV. 

Give an example of an operator T on a real space V for which (T(u), u) = 0 for 
every «eF but T ¥^ 0. 

(i) Set d = r(tt). Then (T^u), T(u)) = 0 and hence T(u) = 0, for every u G V. Accordingly, 

r = o. 
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(ii) By hypothesis, (T(v + w), v + w) - 0 for any v.wGV. Expanding and setting (T(v),v) = 0 
and (T(w), w) = 0, 

(T(v), w) + (T(w), v) = 0 (1) 

Note w is arbitrary in (1). Substituting iw for w, and using (T(v), iw) = i(T(v), w) = 
-i(T(v),w) and (T{iw),v) - (iT(w),v) = i(T(w),v), 

-i(T(v), w) + i(T(w), v) = 0 
Dividing through by i and adding to (1), we obtain (T(w), v) - 0 for any v, w G V. By (i), 

(iii) By (ii), the result holds for the complex ease; hence we need only consider the real case. 
Expanding {T(v + w), v + w) = 0, we again obtain (1). Since T is self-adjoint and since it is 
a real space, we have (T(w),v) = (w,T(v)) = (T(v),w). Substituting this into (1), we obtain 
(T(v), w) = 0 for any v, w G V. By (i), T = 0. 

For our example, consider the linear operator T on R 2 denned by T(x, y) = (y, -x). Then 
<T(w), u) - 0 for every u G V, but T ¥> 0. 



ORTHOGONAL AND UNITARY OPERATORS AND MATRICES 

13.23. Prove Theorem 13.9: The following conditions on an operator U are equivalent: 

(i) U* = U~ 1 ; (ii) (U{v),U(w)) = (v,w), for every v,wGV; (Hi) \\V(v)\\ = \\v\\, for 

every v &V. 

Suppose (i) holds. Then, for every v, w G V, 

(U(v),U(w)) = (,v,U*U(w)) = {v,I(w)) = (v,w) 
Thus (i) implies (ii). Now if (ii) holds, then 

\\U(v)\\ = y/(U(v), U(v)) = V(v^) = IM| 
Hence (ii) implies (iii). It remains to show that (iii) implies (i). 
Suppose (iii) holds. Then for every v G V, 

(U*U(v),v) = (U{v), U(v)) = (v,v) = (I(v), v) 

Hence ((U*V - I)(v),v) = 0 for every v G V. But V* U -I is self -adjoint (Prove!); then by Prob- 
lem 13.22 we have V*U - I - 0 and so U*U = I. Thus U* = t7~i as claimed. 



13.24. Let U be a unitary (orthogonal) operator on V, and let W be a subspace invariant 
under U. Show that W x is also invariant under U. 

Since U is nonsingular, U(W) — W; that is, for any w G W there exists w' G W such that 
U(w')-w. Now let ■uGW~ L . Then for any w G W, 

(U(v), w) = (U(v), U(w')) = (v, w') = 0 
Thus U(v) belongs to W 1 - . Therefore W 1 is invariant under 17. 



13.25. Let A be a matrix with rows R t and columns d. Show that: (i) the i;*-entry of A A* 
is {R it Rj); (ii) the tf-entry of A* A is <Cj, d). 

If A = (ay), then A* = (6 4j ) where 6 {j = Thus i4A* = (c y ) where 

n n 
k — 1 fc — 1 

= <(a a , .... a jn ), (a Jlt . . . , a jn )> = (R u Rj) 
as required. Also, A* A = (d i3 ) where 

n n 

d ij = 2 6 ifc«(cj = 2 «kj»iti = aij^li + <h.fvi + ■ ■ ■ + O-nf^i 
fc = 1 fc=l 

= <(« lj( . . . , a nj ), .... a Bi )> = (C j( Cj> 
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13.26. Prove Theorem 13.11: The following conditions for a matrix A are equivalent: 
(i) A is unitary (orthogonal), (ii) The rows of A form an orthonormal set. (iii) The 
columns of A form an orthonormal set. 

Let R t and C 4 denote the rows and columns of A, respectively. By the preceding problem, 
A A* = (c y ) where c y = (R J ,« J >. Thus AA* - 1 if and only if {R b R } ) = i i} . That is, (i) is 
equivalent to (ii). 

Also, by the preceding problem, A* A = (dy) where d tj - <C j( C { >. Thus A* A = I if and only 
if (C jt C { ) = S i} . That is, (i) is equivalent to (iii). 

Remark: Since (ii) and (iii) are equivalent, A is unitary (orthogonal) if and only if the transpose 
of A is unitary (orthogonal). 

13.27. Find an orthogonal matrix A whose first row is Ui = (1/3, 2/3, 2/3). 

First find a nonzero vector w 2 — (x,y,z) which is orthogonal to u,, i.e. for which 
0 = <iti, w 2 ) = x/Z + 2j//3 + 2z/3 = 0 or x + 2y + 2z = 0 
One such solution is w 2 — (0, 1, —1). Normalize w i to obtain the second row of A, i.e. 
m 2 = (0,lA/2,-l/\/2). 

Next find a nonzero vector w s = (x, y, z) which is orthogonal to both % and u 2 , i.e. for which 
0 = w 3 ) = xlZ + 2y/3 + 2z/3 =0 or x + 2y + 2z = 0 
0 = (u 2 , w 3 ) = y/y/2, - z/V% = 0 or y - z = 0 

Set « = — 1 and find the solution w 3 = (4, —1, —1). Normalize w s and obtain the third row of A, 
i.e. tig = (4A/18, -1A/18, -l/\/l8). Thus 

( 1/3 2/3 2/3 \ 

0 l/>/2 -1A/2 
4/3\/2 -l/3\/2 -I/3V2/ 
We emphasize that the above matrix A is not unique. 

13.28. Prove Theorem 13.12: Let {e x , . . . , e„} be an orthonormal basis of an inner product 
space V. Then the transition matrix from {ei} into another orthonormal basis is 
unitary (orthogonal). Conversely, if P = (a«) is a unitary (orthogonal) matrix, then 
the following is an orthonormal basis: 

{e{ = and + a,ue<i + • • • + a„ie n : i = 1, . . . , n} 

Suppose {/J is another orthonormal basis and suppose 

fi - 6 ii e i + &i2«2 + • • • + b in e n , i = l, ...,« (i) 

By Problem 13.15 and since {/j} is orthonormal, 

8y = </i,/;> = + bitb^ + ■■■ + b in b~ (2) 

Let B = (b(j) be the matrix of coefficients in (1). (Then B* is the transition matrix from {e { } to 
{fi}-) By Problem 13.25, BB* = (c ti ) where c tj = b n b^ + b i2 bj 2 + ■ ■ ■ + b in b~. By (2), e l} = 8„ 
and therefore BB* = /. Accordingly B, and hence B*, are unitary. 

It remains to prove that {e,'} is orthonormal. By Problem 13.15, 

(e' u e'j) = a u djj + a 2i a^ + • • • + a Bi a^ = <C i( C } ) 

where C { denotes the ith column of the unitary (orthogonal) matrix P = (a 0 ). By Theorem 13.11, 
the columns of P are orthonormal; hence (e[,e' s ) = (C^Cj) = S i} . Thus {e,'} is an orthonormal basis. 

13.29. Suppose A is orthogonal. Show that det(A) = 1 or —1. 

Since A is orthogonal, A A* = /. Using |A| = \A*\, 

1 = |/| = |AA*| = |A1|A'| = |Ap 

Therefore |A) = 1 or —1. 
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13.30. Show that every 2 by 2 orthogonal matrix A for which det(A) = 1 is of the form 

/ cos 9 - sin 9 \ 

. „ „ for some real number 9. 

\ sin 9 cos 9 J 

(a b\ 

Suppose A — I 1 . Since A is orthogonal, its rows form an orthonormal set; hence 



k c d y 

a 2 + b 2 = 1, c 2 + d 2 = 1, ac + bd = 0, ad - be = 1 
The last equation follows from det(A) = 1. We consider separately the cases a = 0 and a ¥* 0. 

If a = 0, the first equation gives b 2 = 1 and therefore b = ±1. Then the fourth equation 
gives c = —b — +1, and the second equation yields 1 + d 2 — 1 or d = 0. Thus 



A = 



/0 1 

V-i 0 



0 -1 

1 0 



The first alternate has the required form with e = -jr/2, and the second alternate has the required 
form with s = v/2. 

If a 0, the third equation can be solved to give c = —bd/a. Substituting this into the 
second equation, 

b 2 (P/a 2 + d 2 = 1 or b 2 d 2 + a 2 d 2 = a 2 or (b 2 + a 2 )d 2 = a 2 or a 2 = d 2 

and therefore a = d or a — —d. If a = — d, then the third equation yields c = b and so the 
fourth equation gives —a 2 — c 2 = 1 which is impossible. Thus a = d. But then the third equa- 
tion gives b = — c and so 



A = 



/a — c 
\ c a 



Since a 2 + c 2 = 1, there is a real number $ such that a = cos 6, c — sin 0 and hence A has the 
required form in this case also. 



SYMMETRIC OPERATORS AND CANONICAL FORMS IN EUCLIDEAN SPACES 

13.31. Let T be a symmetric operator. Show that: (i) the characteristic polynomial A{t) of 
T is a product of linear polynomials (over R); (ii) T has a nonzero eigenvector; 
(iii) eigenvectors of T belonging to distinct eigenvalues are orthogonal. 

(i) Let A be a matrix representing T relative to an orthonormal basis of V; then A = A'. Let 
A(t) be the characteristic polynomial of A. Viewing A as a complex self-adjoint operator, A 
has only real eigenvalues by Theorem 13.8. Thus 

A(t) = (t-\ 1 )(t-\ 2 )---(t-\ n ) 

where the \ ( are all real. In other words, A(t) is a product of linear polynomials over B. 

(ii) By (i), T has at least one (real) eigenvalue. Hence T has a nonzero eigenvector. 

(iii) Suppose T(v) = \v and T(w) = fiw where \ ¥= p. We show that \(v, w) = n(v, w): 

\{v,w) = (\v,w) = (T(v),w) = (v,T{w)) = (v,nw) = n{v,w) 
But X ¥> ii\ hence (v, w) = 0 as claimed. 



13.32. Prove Theorem 13.14: Let T be a symmetric operator on a real inner product space 
V. Then there exists an orthonormal basis of V consisting of eigenvectors of T; 
that is, T can be represented by a diagonal matrix relative to an orthonormal basis. 

The proof is by induction on the dimension of V. If dim V = 1, the theorem trivially holds. 
Now suppose dim V = n > 1. By the preceding problem, there exists a nonzero eigenvector v i of 
T. Let W be the space spanned by v u and let u x be a unit vector in W, e.g. let % = i>i/||i>i||. 



CHAP. 13] 



INNER PRODUCT SPACES 



301 



Since v x is an eigenvector of T, the subspace W of V is invariant under T. By Problem 13.21, 
W ± is invariant under T* = T. Thus the restriction T of T to TP 1 is a symmetric operator. By 
Theorem 13.2, V=W®W X . Hence dimW J -=it-l since dim W = 1. By induction, there 
exists an orthonormal basis {tt 2 , . • . , u„} of TP 1 - consisting of eigenvectors of 2* and hence of T. But 
<it 1( ttj) = 0 for i = 2, . . . , n because u { G W 1 . Accordingly {w 1( u n } is an orthonormal set 

and consists of eigenvectors of T. Thus the theorem is proved. 



13.33. Let A = ( g j ) • Find a ( real ) orthogonal matrix P for which P*AP is diagonal. 



The characteristic polynomial A(£) of A is 



A(t) = |t/-A| = 



t - 1 -2 
-2 t - 1 



= i 2 - 2t - 3 = (t - 3)(t + 1) 



and thus the eigenvalues of A are 3 and —1. Substitute t = 3 into the matrix tl — A to obtain the 
corresponding homogeneous system of linear equations 

2x-2y = 0, -2x + 2y = 0 
A nonzero solution is v 1 — (1, 1). Normalize v t to find the unit solution m x = (1/V2~> l/y/2). 

Next substitute t = — 1 into the matrix tl — A to obtain the corresponding homogeneous system 
of linear equations 

-2x - 2y = 0, -2x - 2y = 0 
A nonzero solution is v 2 = (1. — !)• Normalize -u 2 to find the unit solution w 2 = (Vy/2, —\ly[2). 
Finally let P be the matrix whose columns are M t and m 2 respectively; then 



= ( lhf l lh f) and P<AP = (I °) 
\l/V2 -1/V2/ VO -1/ 



P 

As expected, the diagonal entries of P f AP are the eigenvalues of A. 



13.34. Let A = 



Find a (real) orthogonal matrix P for which P*AP is diagonal. 



2 11 
12 1 
112 

First find the characteristic polynomial A(t) of A 

t t-2 -1 -1 

A(t) = \tI-A\ = 



-1 
-1 



t - 2 -1 
-1 t - 2 



= (t-l)2(t-4) 



Substitute 



Thus the eigenvalues of A are 1 (with multiplicity two) and 4 (with multiplicity one), 
t = 1 into the matrix tl — A to obtain the corresponding homogeneous system 

—x — y — z = 0, — a; — 3/ — z = 0, — x — y — z = 0 

That is, ac + 1/ + z = 0. The system has two independent solutions. One such solution is v t = 
(1,-1,0). We seek a second solution v 2 = (a,b,c) which is also orthogonal to v^, that is, such that 

a + b + c = 0 and also a — b = 0 

For example, v 2 = (1, 1, —2). Next we normalize v x and v 2 to obtain the unit orthogonal solutions 

Ul = (1/V2, -1/V2, 0), « 2 = (1/V6, 1/V6, -2A/6) 

Now substitute t = 4 into the matrix t/ — A to find the corresponding homogeneous system 

2x — y — z — 0, -x + 2y — z = 0, —x - y + 2z = 0 

Find a nonzero solution such as v 3 = (1, 1, 1), and normalize v 3 to obtain the unit solution 
u 3 = Vy/s, l/\/3). Finally, if P is the matrix whose columns are the m 4 respectively, 
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P = 



I 1/V2 i/Ve i/V3\ 

-1/V2 1/V6 1/V3 and P f AP 
\ 0 -2/V6 IA/3/ 




13.35. Find an orthogonal change of coordinates which diagonalizes the real quadratic form 
q{x,y) = 2x 2 + 2xy + 2y 2 . 

First find the symmetric matrix A representing q and then its characteristic polynomial A(t): 



A = 



2 1 
1 2 



and A(t) = \tI-A\ = 



t - 2 -1 
-1 t - 2 



(t-l)(t-3) 



The eigenvalues of A are 1 and 3; hence the diagonal form of q is 

q(x', y') = x' 2 + 3y' 2 

We find the corresponding transformation of coordinates by obtaining a corresponding orthonormal 
set of eigenvectors of A. 

Set t = 1 into the matrix tl — A to obtain the corresponding homogeneous system 

— x — y = 0, — x — y — 0 

A nonzero solution is t> x = (1,-1). Now set t = 3 into the matrix tl — A to find the corresponding 
homogeneous system 

x — y — 0, —x + y = 0 

A nonzero solution is v 2 = (1, 1). As expected by Problem 13.31, i>j and v 2 are orthogonal. Normalize 
v x and v 2 to obtain the orthonormal basis 

{«! = (1/V2, -1A/2), M 2 = (1A/2, l/VI)} 

The transition matrix P and the required transformation of coordinates follow: 

1/V2 W2~\ , fx\ „/*'\ x — (x' + v')/V2 



-1/-/2 1/V2 



and 



(;)-'(» 



y = (-x' + y')/y/2 



Note that the columns of P are and w 2 . We can also express and y' in terms of a; and y by 
using P" 1 = P ( ; that is, 

= (x-2/)A/2, j/' = (» + i/)/V2 

13.36. Prove Theorem 13.15: Let T be an orthogonal operator on a real inner product space 
V. Then there is an orthonormal basis with respect to which T has the following 
form: 



! -1 



-1 



-1 



T 
I 

I sin 61 



cos 0i 



— sin 0i 

cos 6 



h 



cos 8 r — sin Or 

COS Or 



j sin 0 r 
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Let S = T + T-i = T + T*. Then S* = (T + T*)* = T* + T = S. Thus S is a symmetric 
operator on V. By Theorem 13.14, there exists an orthonormal basis of V consisting of eigenvectors 
of S. If Xi, . . . , X TO denote the distinct eigenvalues of S, then V can be decomposed into the direct 
sum V — V l © V 2 © • • • © V m where the V s consists of the eigenvectors of S belonging to X f . We 
claim that each V { is invariant under T. For suppose v e Vj-, then S(v) — \v and 

S(T(v)) = (T+T~i)T(v) = T(T+T-*)(v) = TS(v) = T{\ t v) = \ t T(v) 

That is, T(v) & V t . Hence y { is invariant under T. Since the V t are orthogonal to each other, we 
can restrict our investigation to the way that T acts on each individual V ( . 

On a given V lt {T + T~i)v = S(v) = \v. Multiplying by T, 

(T*-\T + I)(v) = 0 

We consider the cases X f = ±2 and X ; ±2 separately. If Xj = ±2, then (T ± I) 2 (v) = 0 which 
leads to (T ± I)(v) = 0 or T(v) = ±v. Thus T restricted to this V t is either I or -/. 

If Xj ¥• ±2, then T has no eigenvectors in V, since by Theorem 13.8 the only eigenvalues of T 
are 1 or —1. Accordingly, for v ¥= 0 the vectors v and T(v) are linearly independent. Let W be 
the subspace spanned by v and T(v). Then W is invariant under T, since 

r(2») = T*(v) = \tT(v) - v 

By Theorem 13.2, V; = W © W X . Furthermore, by Problem 13.24 W 1 is also invariant under T. 
Thus we can decompose V f into the direct sum of two dimensional subspaces Wj where the Wj are 
orthogonal to each other and each Wj is invariant under T. Thus we can now restrict our investiga- 
tion to the way T acts on each individual Wj. 

Since T 2 — X f T + 1 = 0, the characteristic polynomial A(t) of T acting on Wj is A(i) = 
t 2 — \t + 1. Thus the determinant of T is 1, the constant term in A(t). By Problem 13.30, the 
matrix A representing T acting on Wj relative to any orthonormal basis of Wj must be of the form 

(cos e — sin e \ 
sin S cos $ J 

The union of the basis of the Wj gives an orthonormal basis of V ( , and the union of the basis of the 
V { gives an orthonormal basis of V in which the matrix representing T is of the desired form. 



NORMAL OPERATORS AND CANONICAL FORMS IN UNITARY SPACES 
13.37. Determine which matrix is normal: (i) A = ^ * * ^ , (ii) B = ^ j 2 + ij 

«> - = (J ;)Q :h_: ;) - = g x ;)=(-.• J) 

Since AA* ^ A*A, the matrix A is not normal. 

« --g.:*x-4 .ii) =( 2 - 2i 2 n 

B . B = / i i yi M = / 2 2 + 2 ^ 

\-i 2 — i,/ \1 2 + i/ \2-2i 6 / 
Since BB* = B*B, the matrix B is normal. 



13.38. Let T be a normal operator. Prove: 

(i) T(v) = 0 if and only if T*(«) = 0. 

(ii) T — A/ is normal. 

(iii) If T(v) = Av, then T*^) = \v; hence any eigenvector of T is also an eigen- 
vector of T*. 

(iv) If T(v) = \iv and T(w) = X 2 w where At A 2 , then (v,w) = 0; that is, eigen- 
vectors of T belonging to distinct eigenvalues are orthonormal. 
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(i) We show that (T(v), T(v)) = {T*(v), T*(v)y. 

(T(v), T(v)) = (v, T*T(v)) = (v, TT*(y)) - (T*(v), T*(v)) 
Hence by [7 3 ], T(v) = 0 if and only if T*(v) = 0. 

(ii) We show that T — \I commutes with its adjoint: 

(T - \I)(T - = (T-\I)(T* - TT* — XT* — XT + XX/ 

= T*T - \T - \T* + XX/ = (T* -\I)(T-\I) 
= (T-\I)*(T-\I) 

Thus T — \I is normal. 

(iii) If T(v) = \v, then (T - \I)(v) = 0. Now T — \I is normal by (ii); therefore, by (i), 
(T-\I)*{v) - 0. That is, (T*-\l)(v) = 0; hence T*(v) =\v. 

(iv) We show that X^v, w) = \ 2 (v, w): 

\i(v,w) = (\,y,w) = (T(v),w) = {v, T*(w)) = (v,\ 2 w) = \ 2 (v,w) 
But Xj ¥* X 2 ; hence (v, w) = 0. 

13.39. Prove Theorem 13.16: Let T be a normal operator on a complex finite dimensional 
inner product space V. Then there exists an orthonormal basis of V consisting of 
eigenvectors of T; that is, T can be represented by a diagonal matrix relative to an 
orthonormal basis. 

The proof is by induction on the dimension of V. If dim V = 1, then the theorem trivially 
holds. Now suppose dim V = n > 1. Since V is a complex vector space, T has at least one eigen- 
value and hence a nonzero eigenvector v. Let W be the subspace of V spanned by v and let be a 
unit vector in W. 

Since v is an eigenvector of T, the subspace W is invariant under T. However, v is also an 
eigenvector of T* by the preceding problem; hence W is also invariant under T*. By Problem 13.21, 
W" 1 " is invariant under T** = T. The remainder of the proof is identical with the latter part of 
the proof of Theorem 13.14 (Problem 13.32). 



13.40. Prove Theorem 13.17: Let T be an arbitrary operator on a complex finite dimensional 
inner product space V. Then T can be represented by a triangular matrix relative 
to an orthonormal basis {u u u 2 , . . ., tin) ; that is, for i = 1, . . . , n, 

T(ui) — Oi\U\ + ai2«2 + • • • + aim 

The proof is by induction on the dimension of V. If dim V — 1, then the theorem trivially 
holds. Now suppose dim V = n > 1. Since V is a complex vector space, T has at least one eigen- 
value and hence at least one nonzero eigenvector v. Let W be the subspace of V spanned by v and 
let u t be a unit vector in W. Then w, is an eigenvector of T and, say, = a u Wi. 

By Theorem 13.2, V = W © W ± . Let E denote the orthogonal projection of V into W 1 - . 
Clearly is invariant under the operator ET. By induction, there exists an orthonormal basis 
{u 2 , . . .,u n } of W' 1 such that, for i = 2, ,n, 

ET(Ui) = a i2 u 2 + a^Us + • • • + a ti u { 

(Note that {u lt u 2 , ...,u n } is an orthonormal basis of V.) But E is the orthogonal projection of V 
onto W 1 ' ; hence we must have 

T(u ( ) = a n Ui + a i2 u 2 + • • • + a u Ui 
for i = 2, . . . , n. This with T(u t ) = a n u, gives us the desired result. 
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MISCELLANEOUS PROBLEMS 

13.41. Prove Theorem 13.13A: The following conditions on an operator P are equivalent: 

(i) P = T 2 for some self -ad joint operator T. 

(ii) P — S*S for some operator S. 

(iii) P is self -adjoint and (P(u), — 0 for every u G V. 

Suppose (i) holds, that is, P = T* where T = T*. Then P = TT = T*T and so (i) implies 

(ii) . Now suppose (ii) holds. Then P* = (S*S)* = S*S** = S*S = P and so P is self-adjoint. 
Furthermore, 

(P(u),u) = (S*S(u),u) - (S(u),S(u)> ^ 0 
Thus (ii) implies (iii), and so it remains to prove that (iii) implies (i). 

Now suppose (iii) holds. Since P is self-adjoint, there exists an orthonormal basis {%, . . . , u n } 
of V consisting of eigenvectors of P; say, P(u t ) — XjWj. By Theorem 13.8, the Xj are real. Using 

(iii) , we show that the \ are nonnegative. We have, for each i, 

0 - <P(«i), Mj) = (Xjitj, = X^Wj, Mj> 

Thus <«i, Mj> - 0 forces X f - 0, as claimed. Accordingly, V^i is a real number. Let T be the 
linear operator defined by 

T(u { ) = VX; tt i( for i = 1, . . . , n 

Since T is represented by a real diagonal matrix relative to the orthonormal basis {u^, T is self- 
adjoint. Moreover, for each i, 

T^Ui) = TXvTiMj) = y/\iT( Ui ) = vTiVTiM; = \ iUi = P(w ( ) 

Since T 2 and P agree on a basis of V, P = T 2 . Thus the theorem is proved. 

Remark: The above operator T is the unique positive operator such that P = T 2 (Problem 
13.93); it is called the positive square root of P. 

13.42. Show that any operator T is the sum of a self-adjoint operator and skew-adjoint 
operator. 

Set S = $(T + T*) and 17 = %(T - T*). Then T — S + U where 

S* = (^(F+r*))* = ${T* + T**) = l(T* + T) = S 
and U* = ($(T-T*))* = $(T*-T) = -%(T - T*) = -U 

i.e. S is self-adjoint and U is skew adjoint. 

13.43. Prove: Let T be an arbitrary linear operator on a finite dimensional inner product 
space V. Then T is a product of a unitary (orthogonal) operator U and a unique 
positive operator P, that is, T = VP. Furthermore, if T is invertible, then U is also 
uniquely determined. 

By Theorem 13.13, T*T is a positive operator and hence there exists a (unique) positive operator 
P such that P 2 = T*T (Problem 13.93). Observe that 

||P(«)|| 2 = {P(v),P(v)) = {P*(v),v) = (T*T(v),v) = (T(v),T{v)) = \\T(v)\\* (1) 

We now consider separately the cases when T is invertible and non-invertible. 

If T is invertible, then we set U = PT~K We show that U is unitary: 

u* - (pr-i)* = r-»*p* = (r*)-ip and u*u = (r*)-»ppr-i = (t*)-^t*tt-^ = i 

Thus U is unitary. We next set U=U~K Then U is also unitary and T = UP as required. 

To prove uniqueness, we assume T = U 0 P 0 where I7 0 is unitary and P 0 is positive. Then 

T*T = PtutU 0 P 0 = P 0 IP 0 = Pi 

But the positive square root of T*T is unique (Problem 13.93); hence P„ = P. (Note that the 
invertibility of T is not used to prove the uniqueness of P.) Now if T is invertible, then P is also 
by (1). Multiplying U 0 P = UP on the right by P"i yields U 0 = U. Thus U is also unique when 
T is invertible. 
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Now suppose T is not invertible. Let W be the image of P, i.e. W = Im P. We define 
I/j : W -» V by 

= where P(v) = w (2) 

We must show that U t is well defined, that is, that P(v) = P(v') implies T(v) = T(v'). This follows 
from the fact that P(v — v') — 0 is equivalent to \\P(v — v')\\ = 0 which forces [\T(v — v')\\ = 0 
by (1). Thus U l is well defined. We next define U 2 : W -> V. Note by (J) that P and T have the 
same kernels. Hence the images of P and T have the same dimension, i.e. dim (Im P) = dim W = 
dim(ImT). Consequently, W X and (Im T) 1 also have the same dimension. We let U 2 be any 
isomorphism between and (Im T) 1 . 

We next set E7 = J/j © £/ 2 . (Here E7 is defined as follows: if v G V and v = w + w' where 

weH'.w'eff 1 , then C7(v) = U x (w) + U 2 (w').) Now U is linear (Problem 13.121) and, if v G V 
and P(v) = m>, then by (2) 

T(v) = U t (w) = E7(w) = l/P(v) 

Thus T = UP as required. 

It remains to show that U is unitary. Now every vector x G V can be written in the form 
x = P(v) + w' where tt/GPP- 1 . Then U(x) = VP(v) + U 2 (w') = Z» + U 2 (w') where <IT(i>), 
I7 2 (w')> = 0 by definition of I7 2 . Also, <r(i>), r(t>)> = (P(v),P(v)) by (J). Thus 

<i/(*),u(x)> = <r(v) + u 2 ( W '),r(v) + i; 2 (w')> 

- <r(D), r<«)> + <u 2 (w'), t/ 2 ( W ')> 

= <P(V), P(t))> + <tt>', W') = (P(v) + W', P(V) + W') 

- <*, X) 

(We also used the fact that (P(t>), w') = 0.) Thus U is unitary and the theorem is proved. 



13.44. Let (ai, a 2 , . . . ) and (6i, b 2 , . . . ) be any pair of points in k-space of Example 13.5. 

00 

Show that the sum 2 Oi&i = ai&i + 0262 + • • • converges absolutely. 
By Problem 1.16 (Cauchy-Schwarz inequality), 



Mil + ••• + |«„6„| 




which holds for every n. Thus the (monotonic) sequence of sums S n ■ |a 1 6 1 | + • • • + |a„6 B | is 
bounded, and therefore converges. Hence the infinite sum converges absolutely. 



13.45. Let V be the vector space of polynomials over R with inner product defined by 

</» 3) — ( f{t) git) dt. Give an example of a linear functional <j> on V for which 
Jo 

Theorem 13.5 does not hold, i.e. there does not exist a polynomial h(t) for which 

<*>(/) = </» h) for every fGV. 

Let <p : V -» R be defined by <t>(f) = /(0), that is, i> evaluates f(t) at 0 and hence maps f(t) into 
its constant term. Suppose a polynomial h(t) exists for which 



*(/) = /(0) - f l /(t)A(t)dt 



J* 



0 

for every polynomial /(*). Observe that <p maps the polynomial tf(t) into 0; hence by (1), 

tf(t)h(t)dt = 0 (2) 

'0 

for every polynomial f(t). In particular, (2) must hold for /(t) = th(t), that is, 

f 1 &hHt) dt = 0 
•A) 

This integral forces to be the zero polynomial; hence </>{f) = (f,h) = </,0) = 0 for every poly- 
nomial f(t). This contradicts the fact that 0 is not the zero functional; hence the polynomial h(t) 
does not exist. 
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Supplementary Problems 



INNER PRODUCTS 
13.46. Verify that 




13.47. Let u = {x x , x 2 ) and v = (y u y 2 ) belong to R 2 . 

(i) Verify that the following is an inner product on R 2 : 

f(u, v) = x t y x - 2x x y 2 - 2x 2 y x + 5x 2 y 2 

(ii) For what values of k is the following an inner product on R 2 ? 

/(it, v) = x x y x - 3x x y 2 - 3a; 2 2/i + kx 2 y 2 

(iii) For what values of a, b,e,d G R is the following an inner product on R 2 ? 

f(u, v) - ax x y x + bx x y 2 + cx 2 y x + dx 2 y 2 

13.48. Find the norm of v = (1, 2) G R 2 with respect to (i) the usual inner product, (ii) the inner 
product in Problem 13.47(i). 

13.49. Let w = (z x , z 2 ) and v = {w x , w 2 ) belong to C 2 . 

(i) Verify that the following is an inner product on C 2 : 

f(u, v) — z l w 1 + (1 + i)z x w 2 + (1 — i)2 2 w 1 + 3z 2 w 2 

(ii) For what values of a, b, c,d G C is the following an inner product on C 2 ? 

f(u, v) = az 1 w 1 + bz 1 iv 2 + cz 2 w 1 + dz 2 w 2 

13.50. Find the norm of v = (1 — 2i, 2 + Si) G C 2 with respect to (i) the usual inner product, (ii) the 
inner product in Problem 13.49(1). 

13.51. Show that the distance function d{u, v) = \\v — u\\, where u, v G V, satisfies the following axiom 
of a metric space: 

[Z>j] d(u, v) — 0; and d(u, v) = 0 if and only if u = v. 

[D 2 ] d(u, v) = d{v, u). 

[D 3 ] d{u, v) — d(u, w) + d(w, v). 

13.52. Verify the Parallelogram Law: ||m + »|| + = 2||w|| + 2||w||. 

13.53. Verify the following polar forms for (u, v): 

(i) (u,v) = l\\u + v\\^-l\\u-v\\^ (real case); 

(ii) (u,v) = %]\u + v\] 2 - %]\u-v\\2 +i\\u + iv\\z -i\\u-iv\\* (complex case). 

13.54. Let V be the vector space of m X n matrices over R. Show that (A,B) = tr(B t A) defines an inner 
product in V. 



13.55. Let V be the vector space of polynomials over R. Show that (f,g) = I f{t) g(t) dt defines an 



inner product in V. 




13.56. 



Find the norm of each of the following vectors: 

(i) u = (i,-i,i,i)GR* 

(ii) v = (1 - 2i, 3 + i, 2 - 5i) G C 3 , 

(iii) /(f) = t 2 - 2t + 3 in the space of Problem 13.55, 
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13.57. Show that: (i) the sum of two inner products is an inner product; (ii) a positive multiple of an 
inner product is an inner product. 

13.58. Let a, b, c G R be such that at 2 + bt + c - 0 for every J6R. Show that b 2 - Aac ^ 0. Use this 
result to prove the Cauchy-Schwarz inequality for real inner product spaces by expanding 

||tw + i;|P o. 

13.59. Suppose \{u, v)\ = \\u\\ \\v\\. (That is, the Cauchy-Schwarz inequality reduces to an equality.) Show 
that u and v are linearly independent. 

13.60. Find the cosine of the angle 6 between u and v if: 

(i) w = (l,-3,2), v = (2,1,5) inR3; 

(ii) u = 2t- 1, v = t 2 in the space of Problem 13.55; 



(iii) u = 




in the space of Problem 13.54. 



ORTHOGONALITY 

13.61. Find a basis of the subspace W of R 4 orthogonal to m x = (1, -2, 3, 4) and u 2 = (3, —5, 7, 8). 

13.62. Find an orthonormal basis for the subspace W of C 3 spanned by % = (1, i, 1) and % = (1 + i, 0, 2). 

13.63. Let V be the vector space of polynomials over R of degree — 2 with inner product (f, g) = 

C f(t)g(t)dt. 
•'o 

(i) Find a basis of the subspace W orthogonal to h(t) = 24 + 1. 

(ii) Apply the Gram-Schmidt orthogonalization process to the basis {1, t 2 } to obtain an ortho- 
normal basis {wi(i), u 2 (t), u 3 (t)} of V. 

13.64. Let V be the vector space of 2 X 2 matrices over R with inner product denned by (A,B) = tr(B'A). 

(i) Show that the following is an orthonormal basis of V: 

{(o o)' (o o)' (o I)' (o l)} 

(ii) Find a basis for the orthogonal complement of (a) the diagonal matrices, (6) the symmetric 
matrices. 

13.65. Let W be a subset (not necessarily subspace) of V. Prove: (i) = L(W); (ii) if V has finite 
dimension, then W X1 ~ — L(W). (Here L(W) is the space spanned by W.) 

13.66. Let W be the subspace spanned by a nonzero vector w in V, and let E be the orthogonal projection 

of V onto W. Prove E(v) = ' .. w. We call E(v) the projection of v along w. 

IMr 

13.67. Find the projection of v along w if: 

(i) v = (1, -1, 2), w = (0, 1, 1) in VL 3 ; 

(ii) v = (1 - i, 2 + 3i), w = (2 - i, 3) in C 2 ; 

(iii) v = 2t — 1, w = t 2 in the space of Problem 13.55; 



(iv) v 




in the space of Problem 13.54. 



13.68. 



Suppose {u u . . .,u r ) is a basis of a subspace W of V where dim V = n. Let {v lt . . .,i; n _ r } be an 
independent set of n — r vectors such that <u t , Vj) = 0 for each i and each j. Show that 
{v v . . ., v„_ r } is a basis of the orthogonal complement W . 
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13.69. Suppose {w 1( . . . , u r ] is an orthonormal basis for a subspace W of V. Let E : V -» V be the linear 
mapping defined by „, , , , , 

E(V) = (V, + (V, U 2 )U 2 +•••+<!», M r )M r 

Show that 2? is the orthogonal projection of V onto W. 

r 

13.70. Let {i^ M r } be an orthonormal subset of V. Show that, for any v <E V, 2 K»,«<>l 2 - I Ml 2 - 

(This is known as Bessel's inequality.) 

13.71. Let V be a real inner product space. Show that: 

(i) IM| = |M| if and only if (u + v, u — v) = 0; 

(ii) ||«* + v\\ 2 = ||u|| 2 + IMI 2 if and only if <«,«> = 0. 

Show by counterexamples that the above statements are not true for, say, C 2 . 

13.72. Let U and W be subspaces of a finite dimensional inner product space V. Show that: (i) (U + W) X = 
U 1 - n PF X ; (ii) (17 n W) x = U 1 + W 1 . 



ADJOINT OPERATOR 

13.73. Let T : R 3 -» R 3 be defined by T(x, y, z) = (x + 2y, %x - 4s, 2/)- Find T* (x, y, z). 

13.74. Let T : C 3 -» C 3 be defined by 

r(x, j/, z) = {ix + (2 + 3t)y, 3x + (3 - (2 - U)y + iz) 

Find T*(x,y,z). 

13.75. For each of the following linear functions <f> on V find a vector it£V such that <6(i>) = (v, u) for 
every 

(i) 0 : R 3 -* R defined by <p(x, y,z) = « + 2j/ - 3z. 

(ii) 0 : C 3 -» C defined by 0(«, j/, 2) = mb + (2 + 3i)y + (1 - 2t>. 

(iii) 0 : V -* R defined by <f>(f) = /(l) where V is the vector space of Problem 13.63. 

13.76. Suppose V has finite dimension. Prove that the image of T* is the orthogonal complement of the 
kernel of T, i.e. Im T* = (Ker T) L . Hence rank(T) = rank(r*). 

13.77. Show that T*T = 0 implies T = 0. 

13.78. Let V be the vector space of polynomials over R with inner product defined by (/, g) — I f(t) g(t) dt. 

Jo 

Let D be the derivative operator on V, i.e. D(f) = df/dt. Show that there is no operator D* on V 
such that (D(f),g) - (f,D*(g)) for every /,jeV. That is, D has no adjoint. 



UNITARY AND ORTHOGONAL OPERATORS AND MATRICES 

13.79. Find an orthogonal matrix whose first row is: (i) (1/V5, 2/V5); (ii) a multiple of (1,1,1). 

13.80. Find a symmetric orthogonal matrix whose first row is (1/3,2/3,2/3). (Compare with Problem 
13.27.) 

13.81. Find a unitary matrix whose first row is: (i) a multiple of (1, 1 — i); (ii) (^, ^ — Ji) 

13.82. Prove: The product and inverses of orthogonal matrices are orthogonal. (Thus the orthogonal 
matrices form a group under multiplication called the orthogonal group.) 

13.83. Prove: The product and inverses of unitary matrices are unitary. (Thus the unitary matrices 
form a group under multiplication called the unitary group.) 

13.84. Show that if an orthogonal (unitary) matrix is triangular, then it is diagonal. 
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13.85. Recall that the complex matrices A and B are unitarily equivalent if there exists a unitary matrix 
P such that B = P*AP. Show that this relation is an equivalence relation. 

13.86. Recall that the real matrices A and B are orthogonally equivalent if there exists an orthogonal 
matrix P such that B — P t AP. Show that this relation is an equivalence relation. 

13.87. Let W be a subspace of V. For any v&V let v = w + w' where w S W, w' G W X . (Such a sum 
is unique because V = W © W X .) Let T.V^V be denned by T(v)=w-w'. Show that T is 
a self-adjoint unitary operator on V. 

13.88. Let V be an inner product space, and suppose U : V -» V (not necessarily linear) is surjective (onto) 
and preserves inner products, i.e. (U(v),U(w)) = (u,w) for every v,we.V. Prove that U is 
linear and hence unitary. 



POSITIVE AND POSITIVE DEFINITE OPERATORS 

13.89. Show that the sum of two positive (positive definite) operators is positive (positive definite). 

13.90. Let T be a linear operator on V and let / : V X V -» K be defined by f(u, v) = (T(u), v). Show 
that / is itself an inner product on V if and only if T is positive definite. 

13.91. Suppose E is an orthogonal projection onto some subspace W of V. Prove that kl + E is positive 
(positive definite) if k — Q (fc > 0). 

13.92. Prove Theorem 13.13B, page 288, on positive definite operators. (The corresponding Theorem 
13.13A for positive operators is proved in Problem 13.41.) 

13.93. Consider the operator T defined by T(u { ) = VTjMj, i = 1, . . .,n, in the proof of Theorem 13.13A 
(Problem 13.41). Show that T is positive and that it is the only positive operator for which T 1 = P. 

13.94. Suppose P is both positive and unitary. Prove that P = /. 

13.95. An re X w (real or complex) matrix A = (a ;j ) is said to be positive if A viewed as a linear operator 
on if" is positive. (An analogous definition defines a positive definite matrix.) Prove A is positive 
(positive definite) if and only if a i} = a£ and 

n 

2 BuSi*! - o (>o) 

U = l 

for every (x lt . . . , x n ) in K n . 
13 96 Determine which of the following matrices are positive (positive definite): 

(ID (-:;) (.::) go (ID (i:) 

(i) (ii) (iii) (iv) (v) (vi) 

13.97. Prove that a 2X2 complex matrix A = ^ is positive if and only if (i) A = A*, and 
(ii) a, d and ad — be are nonnegative real numbers. 

13.98. Prove that a diagonal matrix A is positive (positive definite) if and only if every diagonal entry is 
a nonnegative (positive) real number. 

SELF-ADJOINT AND SYMMETRIC OPERATORS 

13.99. For any operator T, show that T + T* is self -adjoint and T - T* is skew-adjoint. 

13.100. Suppose T is self-adjoint. Show that T*{v) = 0 implies T(v) = 0. Use this to prove that 
T n (v) = 0 also implies T(v) = 0 for w > 0. 
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13.101. Let V be a complex inner product space. Suppose (T(v), v) is real for every v G V. Show that T 
is self-adjoint. 

13.102. Suppose S and T are self-adjoint. Show that ST is self-adjoint if and only if S and T commute, 
i.e. ST = TS. 

13.103. For each of the following symmetric matrices A, And an orthogonal matrix P for which P ( AP is 
diagonal: 



13.104. Find an orthogonal transformation of coordinates which diagonalizes each quadratic form: 
(i) q(x, y) = 2a; 2 - 6xy + 10y*, (ii) q(x, y) = x 2 + Sxy - 5j/2 

13.105. Find an orthogonal transformation of coordinates which diagonalizes the quadratic form 

q(x, y, z) = 2xy + 2xz + 2yz. 

NORMAL OPERATORS AND MATRICES 



13.106. Verify that A = 
find P*AP. 




is normal. Find a unitary matrix P such that P*AP is diagonal, and 



13.107. Show that a triangular matrix is normal if and only if it is diagonal. 

13.108. Prove that if T is normal on V, then ||!T(u)|| = ||r*(v)|| for every ti£V. Prove that the converse 
holds in complex inner product spaces. 

13.109. Show that self-adjoint, skew-adjoint and unitary (orthogonal) operators are normal. 

13.110. Suppose T is normal. Prove that: 

(i) T is self-adjoint if and only if its eigenvalues are real. 

(ii) T is unitary if and only if its eigenvalues have absolute value 1. 

(iii) T is positive if and only if its eigenvalues are nonnegative real numbers. 

13.111. Show that if T is normal, then T and T* have the same kernel and the same image. 

13.112. Suppose S and T are normal and commute. Show that S + T and ST are also normal. 

13.113. Suppose T is normal and commutes with S. Show that T also commutes with S*. 

13.114. Prove: Let S and T be normal operators on a complex finite dimensional vector space V. Then 
there exists an orthonormal basis of V consisting of eigenvectors of both S and T. (That is, S and 
T can be simultaneously diagonalized.) 

ISOMORPHISM PROBLEMS 

13.115. Let {e u . . . , e n } be an orthonormal basis of an inner product space V over K. Show that the map 
v H> [v] e is an (inner product space) isomorphism between V and X". (Here [v] e denotes the co- 
ordinate vector of v in the basis {e t }.) 

13.116. Show that inner product spaces V and W over K are isomorphic if and only if V and W have the 
same dimension. 



13.117. Suppose {e 1; . . . , e J and {e[, ...,e' n ) are orthonormal bases of V and W respectively. Let T : V -» W 
be the linear map defined by T(eJ - e[, for each i. Show that T is an isomorphism. 




312 



INNER PRODUCT SPACES 



[CHAP. 13 



13.118. Let V be an inner product space. Recall (page 283) that each u£V determines a linear functional 
u in the dual space V* by the definition u(v) = (v,u) for every v E.V. Show that the map 
u H> u is linear and nonsingular, and hence an isomorphism from V onto V*. 

13.119. Consider the inner product space V of Problem 13.54. Show that V is isomorphic to R m " under the 
mapping 

/ 011 a 12 • • • a ln \ 

h- > R 2 , ■ ■ ■ , R m ) 





/Oil 


a 12 . 


■ «ln 






<&22 


• «2n 




\«ml 


°m2 • 


• Ojnn 



where R t = (a il( o i2 , . . .,«;„), the ith row of A. 
MISCELLANEOUS PROBLEMS 

13.120. Show that there exists an orthonormal basis {u lt . . ,,mJ of V consisting of eigenvectors of T if and 
only if there exist orthogonal projections E 1 ,...,E r and scalars \j, ...,X r such that: (i) T = 
XiJ?! + • • • + X,J£ r ; (ii) Ei + ••• + E r = I; (iii) EiEj = 0 for i ¥> j. 

13.121. Suppose V = U (B W and suppose T t ;U-*V and T 2 : W -+ V are linear. Show that T = 
T 1 © T 2 is also linear. (Here T is defined as follows: if v G V and v — u + w where u G U, w G W, 
then T(v) = T x (u) + T 2 (w).) 

13.122. Suppose U is an orthogonal operator on B s with positive determinant. Show that U is either a 
rotation or a reflection through a plane. 



Answers to Supplementary Problems 

13.47. (ii) k > 9; (iii) a > 0, d > 0, ad - be > 0 

13.48. (i) VE, (ii) yn 
13.50. (i) 3\/2, (ii) &V2 

13.56. (i) ||u||=V66/12, (ii) \\v\\ = 2VU, (iii) ||/(t)|| = VWTB, (iv) ||A|| = V30 

13.60. (i) cos e = 9/V420, (ii) cos e = Vl5/6, (iii) cos e - 2/v / 210 

13.61. {v x = (1,2,1,0), v 2 = (4,4,0,1)} 

13.62. {v t = (1, i, \)lyfl, v 2 = (2i, 1 - Si, 3 - i)/ V24} 

13.63. (i) {A (t) = 7*2 - 5*, / 2 (t) = 12*2 _ 5} 

(ii) { M ,(t) = l, M2 (t) = (2f - 1)/V3, M 3 (*) = (6* 2 - 6< + 1)/V5 } 

»•«•<»> «•>{(: i)-c :)}■ «{(r«)} 

13.67. (i) (0, 1/V2, l/y/2), (ii) (26 + 7i, 27 + 24i)/Vl4, (iii) V5 t 2 /6, (iv) ^_ ?/ ^ ^J^j 

13.73. r*(«, 2/, z) = (» + 32/, 2* + z, -4y) 

13.74. r*(x, y, z) = (-ix + Sy, (2 - 3t> + (2 + 5i)z, (S + i)y-iz) 
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13.75. Let u = 4>(e 1 )e 1 + • • ■ + <f>(e n )e n where {e f } is an orthonormal basis. 

(i) u = (1, 2, -3), (ii) u = (-i, 2 - 3i, 1 + 2i), (iii) u = (18*2 - 8t + 13)/15 

'l/yfl 1/VI l/y/s\ 
13.79. (i) I " * " ) , (ii) I 0 1A/2 -1/V2 



\2/\/6 -1A/6 -1/V6/ 



1/3 2/3 2/3' 
13.se, [ 2/3 -2/3 1/3 
.2/3 1/3 -2/3/ 



JL 
2 



13.96. Only (i) and (v) are positive. Moreover, (v) is positive definite. 

(1 , p = I -•- = ; 2 A/S -W5\ ( . u) p / WIS -WIS 



\-Wb 2/V5/ \-i 



-l/VB 2/V5/ \-i/a/io 3/\/l0 

13.104. (i) * = (3*' - j/')A/lO, y = (x' + 3y')/y/T0, (ii) * = (2*' - ?')/V5, » = (»' + 2j/')/V5 

13.105. x = »7v^3 + y'/VI + z'lyfl, y = x'lyfl - y'lyfl + z'tyfl, z = x'lyfl-tz'lyfl 

/l/V2 -1/V2\ /2 + i 0 \ 

1S - ,M - ' " \vA WiJ" PMP = ( 0 -«) 
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Sets and Relations 

SETS, ELEMENTS 

Any well defined list or collection of objects is called a set; the objects comprising the 
set are called its elements or members. We write 

p G A if p is an element in the set A 

If every element of A also belongs to a set B, i.e. if x e A implies x G B, then A is called 
a subset of B or is said to be contained in B; this is denoted by 

A C B or B D A 

Two sets are egitai if they both contain the same elements; that is, 

A = B if and only if A C B and BcA 

The negations of p£A, AcB and A = B are written p & A, A<£B and A¥=B 
respectively. 

We specify a particular set by either listing its elements or by stating properties which 
characterize the elements in the set. For example, 

A = {1,3,5,7,9} 

means A is the set consisting of the numbers 1, 3, 5, 7 and 9; and 

B = {x : x is a prime number, x < 15} 

means that B is the set of prime numbers less than 15. We also use special symbols to 
denote sets which occur very often in the text. Unless otherwise specified: 

N = the set of positive integers: 1, 2, 3, ... ; 

Z = the set of integers: . . . , —2, —1, 0, 1, 2, ... ; 

Q = the set of rational numbers; 

K = the set of real numbers; 

C = the set of complex numbers. 

We also use 0 to denote the empty or null set, i.e. the set which contains no elements; this 
set is assumed to be a subset of every other set. 

Frequently the members of a set are sets themselves. For example, each line in a set 
of lines is a set of points. To help clarify these situations, we use the words class, collection 
and family synonymously with set. The words subclass, subcollection and subfamily have 
meanings analogous to subset. 

Example A.l : The sets A and B above can also be written as 

A = {a; G N : a; is odd, x < 10} and B = {2,3,5,7,11,13} 

Observe that 9 G A but 9 G B, and 11GB but 11 G A; whereas 3GA and 
3 GB, and 6 G A and 6 GB. 

Example A.2: The sets of numbers are related as follows: NcZcQcRcC. 

Example A .3: Let C = {x : x 2 = 4, x is odd}. Then C = 0, that is, C is the empty set. 

Example A.4: The members of the class {{2,3}, {2}, {5,6}} are the sets {2,3}, {2} and {5,6}. 
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The following theorem applies. 

Theorem A.l: Let A, B and C be any sets. Then: (i) Ac A; (ii) if Ac 5 and B (ZA, 
then A = B; and (iii) if A c B and BcC, then A c C. 

We emphasize that AcB does not exclude the possibility that A = B. However, if 
A C B but A¥*B, then we say that A is a proper subset of B. (Some authors use the 
symbol c for a subset and the symbol c only for a proper subset.) 

When we speak of an indexed set {a*: i G I}, or simply {cu}, we mean that there is a 
mapping <f> from the set / to a set A and that the image 4>{i) of i G J is denoted Oi. The set 
I is called the indexing set and the elements <u (the range of <£) are said to be indexed by I. 
A set {Oi, an, . . . } indexed by the positive integers N is called a sequence. An indexed class 
of sets {Ai : i G /}, or simply {Ai}, has an analogous meaning except that now the map 4> 
assigns to each i G / a set A 4 rather than an element a*. 

SET OPERATIONS 

Let A and B be arbitrary sets. The union of A and B, written AuB, is the set of 
elements belonging to A or to B; and the intersection of A and B, written AC\B, is the set 
of elements belonging to both A and B: 

AUB - {x : xeA or x G B} and A n Z? = {x: x G A and x G B) 

If A n Z? = 0, that is, if A and £ do not have any elements in common, then A and B are 
said to be disjoint. 

We assume that all our sets are subsets of a fixed universal set (denoted here by U). 
Then the complement of A, written A c , is the set of elements which do not belong to A: 

A° = {x G 17: a; G A} 

Example AJ: The following diagrams, called Venn diagrams, illustrate the above set operations. 

Here sets are represented by simple plane areas and U, the universal set, by the 
area in the entire rectangle. 




A c is shaded 
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Sets under the above operations satisfy various laws or identities which are listed in 
the table below. In fact, we state 

Theorem A.2: Sets satisfy the laws in Table 1. 



LAWS OF THE ALGEBRA OF SETS 


la. 


AuA = A 


Idempotent Laws 

lb. AnA = A 


2a. 


(AnB)uC = Au(BuC) 


Associative Laws 

2b. (AnB)nC = An(BnC) 


3a. 


AuB - BuA 


Commutative Laws 

3b. A nB = BnA 


4a. 


Distributive Laws 

Au(BnC) = (Aufi)n(AuC) 4b. An(BuC) = (AnB)u(AnC) 


5a. 
6a. 


Au0 = A 
AuU = U 


Identity Laws 

5b. AnU = A 
6b. An (3 = 0 


7a. 
8a. 


AuA* = U 

(Ac)e = A 


Complement Laws 

7b. A nA c = 0 
8b. U° = 0, 0c = U 


9a. 


(AuB) c = A c ni?« 


De Morgan's Laws 

9b. (AnJS) c = A c uB° 


Table 1 



Remark: Each of the above laws follows from an analogous logical law. For example, 

AnB = {x: xEA and x<ZB) = {x : x G B and x G A} = BnA 

(Here we use the fact that the composite statement "p and q", written p a q, is 
logically equivalent to the composite statement "q and p", i.e. q a p.) 

The relationship between set inclusion and the above set operations follows. 
Theorem A.3: Each of the following conditions is equivalent to AcB: 

(i) AnB = A (iii) B c cA c (v)BuA c =C7 

(ii) AUB = B (iv) AnB c = 0 

We generalize the above set operations as follows. Let {Ai : i E /} be any family of 
sets. Then the union of the A it written U iei A i (or simply U s Ai), is the set of elements 
each belonging to at least one of the Aij and the intersection of the A{, written n. ei A 4 or 
simply n j Ai, is the set of elements each belonging to every Ai. 

PRODUCT SETS 

Let A and B be two sets. The product set of A and B, denoted by A X B, consists of all 
ordered pairs (a, 6) where a e A and b GB: 

AxB = {{a, b) : a G A, b G B} 
The product of a set with itself, say A x A, is denoted by A 2 . 
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Example A.6: The reader is familiar with the cartesian plane R 2 — R X R as shown below. Here 
each point P represents an ordered pair (a, b) of real numbers, and vice versa. 



6 


•2 


,P 


-1 




— i i 1 

-S -2 -1 


1 2 0. 3 

-l 
-2 



Example A.7: Let A = {1,2,3} and B - {a,b}. Then 

A X B = {(1, a), (1, 6), (2, a), (2, b), (3, a), (3, b)} 

Remark: The ordered pair (a, b) is denned rigorously by (a, b) = {{a}, {a, &}}. From this 
definition, the "order" property may be proven; that is, (a, b) = (c, d) if and only 
if a = c and b = d. 

The concept of product set is extended to any finite number of sets in a natural way. 
The product set of the sets A u . . . , A m , written Ai x A 2 x • • • x A m , is the set consisting of 
all w-tuples (a u a 2 , . . .,Om) where a s e A t for each i. 

RELATIONS 

A binary relation or simply relation R from a set A to a set B assigns to each ordered 
pair (a, b) G A x B exactly one of the following statements: 

(i) "a is related to b", written aRb, 

(ii) "a is not related to b", written alfcb. 
A relation from a set A to the same set A is called a relation in A. 

Example A.8: Set inclusion is a relation in any class of sets. For, given any pair of sets A and B, 
either A cB or A<tB. 

A. 

Observe that any relation R from A to B uniquely defines a subset R of A x B as follows: 

R = {(a, b): aRb} 

Conversely, any subset R of A x B defines a relation from A to B as follows: 

aRb if and only if (a, b) S R 

In view of the above correspondence between relations from A to B and subsets of AxB, 
we redefine a relation as follows: 

Definition: A relation R from A to B is a subset of A x B. 

EQUIVALENCE RELATIONS 

A relation in a set A is called an equivalence relation if it satisfies the following axioms: 
[Ei] Every a € A is related to itself. 
[Ez] If a is related to b, then b is related to a. 
[Es] If a is related to b and & is related to c, then a is related to c. 

In general, a relation is said to be reflexive if it satisfies [Ei], symmetric if it satisfies [E z ], 
and transitive if it satisfies [2?s]. In other words, a relation is an equivalence relation if 
it is reflexive, symmetric and transitive. 



APPENDIX A] 



SETS AND RELATIONS 



319 



Example A.9: Consider the relation C of set inclusion. By Theorem A.l, Ac A for every set A; 

and if A c B and B c C, then AcC. That is, C is both reflexive and transitive. 
On the other hand, C is not symmetric, since A c B and A ¥= B implies B A. 

Example A.10: In Euclidean geometry, similarity of triangles is an equivalence relation. For if 
a, p and y are any triangles, then: (i) o is similar to itself; (ii) if a is similar to p, 
then p is similar to a; and (iii) if a is similar to p and p is similar to y, then a is 
similar to y. 

If R is an equivalence relation in A, then the equivalence class of any element a e A, 
denoted by [a], is the set of elements to which a is related: 

[a] = {x: aRx) 

The collection of equivalence classes, denoted by AIR, is called the quotient of A by R: 

AIR = {[a] : a S A} 
The fundamental property of equivalence relations follows: 

Theorem A.4: Let R be an equivalence relation in A. Then the quotient set AIR is a 
partition of A, i.e. each a e A belongs to a member of AIR, and the mem- 
bers of A/R are pairwise disjoint. 

Example A.11: Let R 5 be the relation in Z, the set of integers defined by 

x = y (mod 5) 

which reads "x is congruent to y modulo 5" and which means "x - y is divisible by 5". 
Then R s is an equivalence relation in Z. There are exactly five distinct equivalence 
classes in Z//Z s : 



A 0 = { 


..,-10, -5, 0, 5, 10} 


At = { 


...,-9, -4, 1, 6, 11} 


A 2 = { 


..,-8, -3, 2, 7,12} 


A 3 = { 


..,-7,-2, 3, 8, 13} 


A 4 = { 


..,-6, -1,4, 9, 14} 



Now each integer x is uniquely expressible in the form x = 5q + r where 0 — r < 5; 
observe that x G E r where r is the remainder. Note that the equivalence classes 
are pairwise disjoint and that Z = A 0 uA,uA 2 uA 3 uA 4 . 
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Algebraic Structures 

INTRODUCTION 

We define here algebraic structures which occur in almost all branches of mathematics. 
In particular we will define a field which appears in the definition of a vector space. We 
begin with the definition of a group, which is a relatively simple algebraic structure with 
only one operation and is used as a building block for many other algebraic systems. 

GROUPS 

Let G be a nonempty set with a binary operation, i.e. to each pair of elements a, b G G 
there is assigned an element ab G G. Then G is called a group if the following axioms hold: 

[Gi] For any a, b, c G G, we have (ab)c = a(bc) (the associative law). 

[G2] There exists an element e G G, called the identity element, such that ae = ea = a for 
every a G G. 

[Ga] For each aGG there exists an element cr 1 G G, called the inverse of a, such that 
ao -1 = a -1 a = e. 

A group G is said to be abelian (or: commutative) if the commutative law holds, i.e. if 
ab = ba for every a,b GG. 

When the binary operation is denoted by juxtaposition as above, the group G is said 
to be written multiplicatively. Sometimes, when G is abelian, the binary operation is de- 
noted by + and G is said to be written additively. In such case the identity element is 
denoted by 0 and is called the zero element; and the inverse is denoted by —a and is called 
the negative of a. 

If A and B are subsets of a group G then we write 

AB = {ab : a G A, b G B}, or A + B - {a + b : a G A, b G B) 
We also write a for {a}. 

A subset H of a group G is called a subgroup of G if H itself forms a group under the 
operation of G. If H is a subgroup of G and aGG, then the set Ha is called a right coset 
of H and the set aH is called a left coset of H. 

Definition: A subgroup H of G is called a normal subgroup if a^HacH for every aGG. 

Equivalently, H is normal if aH = ffa for every aGG, i.e. if the right and 
left cosets of H coincide. 

Note that every subgroup of an abelian group is normal. 

Theorem B.1: Let H be a normal subgroup of G. Then the cosets of H in G form a group 
under coset multiplication. This group is called the quotient group and is 
denoted by G/H. 

Example B.1 : The set Z of integers forms an abelian group under addition. (We remark that the 
even integers form a subgroup of Z but the odd integers do not.) Let H denote the 
set of multiples of 5, i.e. H = {. . .,—10, -5, 0, 5, 10, . . .}. Then H is a subgroup 
(necessarily normal) of Z. The cosets of H in Z follow: 



320 



APPENDIX BJ 



ALGEBRAIC STRUCTURES 



321 



0 = 0 + H 

1 = l + H 

2 = 2 + H = { 

3 = S + H = { 

4 = 4 + £T = { 



H = {...,-10,-5,0,5,10, 
{...,-9, -4, 1, 6, 11, ...} 

• •} 

• •} 
■ •} 



. , 8, 3, 2, 7, 12, 
•f 7, 2, 3, 8, 13, 
.,-6, -1,4, 9, 14, 



For any other integer n G Z, m = w + H coincides with one of the above cosets. 
Thus by the above theorem, Z/H = (0, 1, 2, 3, 4} forms a group under coset addition; 
its addition table follows: 



+ 


0 


1 


2 


3 


4 


0 


0 


1 


2 


3 


4 


1 


1 


2 


3 


4 


0 


2 


2 


3 


4 


0 


1 


3 


3 


4 


0 


1 


2 


4 


1 


0 


T 


2 


3 



Example B.2: 



This quotient group Z/H is referred to as the integers modulo 5 and is frequently 
denoted by Z 5 . Analogously, for any positive integer n, there exists the quotient 
group Z n called the integers modulo n. 

The permutations of n symbols (see page 171) form a group under composition of 
mappings; it is called the symmetric group of degree n and is denoted by S n . We 
investigate S 3 here; its elements are 



■a 
■a 



!) 



" 2 = (3 
" = (2 



* = (2 
= (J 



0 



3 

? * 

tiplication table of S 3 is 



/* 2 3 \ . ,. 

Here f ^ fc y 18 e Permutation which maps 1 ■-» i, 2 t-» 3 (-» fc. The mul- 





« 




(T 2 


"3 


01 


02 


e 


e 


"1 


"2 


"3 


01 


02 


"1 


ff » 


f 


01 


02 


"2 


"3 


<r 2 


"2 


02 


e 


01 


"3 


"1 


»3 


»3 


01 


02 


c 


"1 


"2 


01 


01 


»3 


"1 


"2 


02 


e 


02 


02 


»2 


"3 


"\ 


e 


01 



(The element in the ath row and 6th column is ab.) The set H = {e, is a sub- 
group of S 3 ; its right and left cosets are 

Right Cosets Left Cosets 

H = {,,„,} H = 

#01 = {01. "2) 01# — {01, "3} 

#02 = {02> ff 3} 02# = {02.0-2} 

Observe that the right cosets and the left cosets are distinct; hence H is not a normal 
subgroup of S3. 

A mapping / from a group G into a group G' is called a homomorphism if /(a&) = 
/(a)/(b) for every a, & G G. (If / is also bijective, i.e. one-to-one and onto, then / is called 
an isomorphism and G and G' are said to be isomorphic.) If f:G-*G' is a homomorphism, 
then the feerraei of / is the set of elements of G which map into the identity element e' € G': 

kernel of / = {o G G : f(a) = e'} 

(As usual, f{G) is called the iwaflre of the mapping /: G->G'.) The following theorem 
applies. 

Theorem B.2: Let f:G-*G' be a homomorphism with kernel K. Then X is a normal 
subgroup of G, and the quotient group G/K is isomorphic to the image of /. 
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Example B.3: Let <? be the group of real numbers under addition, and let G' be the group of 
positive real numbers under multiplication. The mapping / : G -> G' denned by 
/(a) — 2 a is a homomorphism because 

f(a+b) = 2° + " = 2«2*> = f{a)f(b) 

In particular, / is bijective; hence G and G' are isomorphic. 

Example B.4: Let G be the group of nonzero complex numbers under multiplication, and let G' 
be the group of nonzero real numbers under multiplication. The mapping / : G -» G' 
defined by f(z) = \z\ is a homomorphism because 

/(«l2 2 ) = 1*1*21 = Nil W\ = /( 2 l) f("2> 

The kernel K of / consists of those complex numbers z on the unit circle, i.e. for 
which |z| = 1. Thus G/K is isomorphic to the image of /, i.e. to the group of positive 
real numbers under multiplication. 

RINGS, INTEGRAL DOMAINS AND FIELDS 

Let R be a nonempty set with two binary operations, an operation of addition (denoted 
by +) and an operation of multiplication (denoted by juxtaposition). Then R is called a 
ring if the following axioms are satisfied: 
[Ri] For any a,b,c G R, we have (a + b) + c = a + (b + c). 

[R2] There exists an element 0 G R, called the zero element, such that a + 0 = 0 + a = a 
for every a G R. 

[R 3 ] For each aGR there exists an element —a G R, called the negative of a, such that 
o + (—a) = (—a) + a = 0. 

[Ri\ For any a,b &R, we have a + b = & + a. 

[# 5 ] For any a,b,c€ R, we have (a&)c = a(bc). 

[Re] For any a, 6, c G R, we have: 

(i) a(& + c) = ab + ac, and (ii) (& + e)a =ba + ca. 

Observe that the axioms [Ri] through [R4] may be summarized by saying that R is an 
abelian group under addition. 

Subtraction is denned in R by a — b = a + (—&). 

It. can be shown (see Problem B.25) that a • 0 = 0 • a — 0 for every a G R. 

R is called a commutative ring if 06 = ba for every a, b G R. We also say that R is 
a ring with a unit element if there exists a nonzero element 1 G R such that a - 1 = 1 • a = a 
for every a G R. 

A nonempty subset S of R is called a subring of R if S itself forms a ring under the 
operations of R. We note that S is a subring of /? if and only if a, b G S implies a — b G S 
and ab G S. 

A nonempty subset / of R is called a Ze/t ideai in R if : (i) a — & G / whenever a, 6 G /, 
and (ii) ra G / whenever r e R, a G I. Note that a left ideal / in R is also a subring of R. 
Similarly we can define a right ideal and a two-sided ideal. Clearly all ideals in com- 
mutative rings are two-sided. The term ideal shall mean two-sided ideal unless otherwise 
specified. 

Theorem B.3: Let / be a (two-sided) ideal in a ring R. Then the cosets {a + I: a G R} 
form a ring under coset addition and coset multiplication. This ring is 
denoted by R/I and is called the quotient ring. 

Now let R be a commutative ring with a unit element. For any a G R, the set 
(a) = {ra : r G R} is an ideal; it is called the principal ideal generated by a. If every ideal 
in R is a principal ideal, then R is called a principal ideal ring. 

Definition: A commutative ring R with a unit element is called an integral domain if R 
has no zero divisors, i.e. if ab — 0 implies a = 0 or b — 0. 
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Definition: A commutative ring R with a unit element is called a field if every nonzero 
a G R has a multiplicative inverse, i.e. there exists an element a -1 E R such 
that aa _1 = a -1 a - 1. 

A field is necessarily an integral domain; for if ab = 0 and a ¥■ 0, then 

b = 1-b = a _1 ab = a -1 *0 = 0 

We remark that a field may also be viewed as a commutative ring in which the nonzero 
elements form a group under multiplication. 

Example B.5 : The set Z of integers with the usual operations of addition and multiplication is the 
classical example of an integral domain with a unit element. Every ideal I in Z is 
a principal ideal, i.e. I = (n) for some integer n. The quotient ring Z„ = Z/(n) 
is called the ring of integers modulo n. If n is prime, then Z n is a field. On the 
other hand, if n is not prime then Z n has zero divisors. For example, in the ring Z 6 , 
2 3=0 and 2^0 and 3 # 0. 

Example B.6: The rational numbers Q and the real numbers R each form a field with respect 
to the usual operations of addition and multiplication. 

Example B.7: Let C denote the set of ordered pairs of real numbers with addition and multiplica- 
tion defined by 

(a, 6) + (c, d) = (a+c,b + d) 

(a, 6) • (e, d) = (ac —bd, ad + be) 

Then C satisfies all the required properties of a field. In fact, C is just the field of 
complex numbers (see page 4). 

Example B.8: The set M of all 2 by 2 matrices with real entries forms a noncommutative ring with 
zero divisors under the operations of matrix addition and matrix multiplication. 

Example B.9: Let R be any ring. Then the set R[x] of all polynomials over R forms a ring with 
respect to the usual operations of addition and multiplication of polynomials. 
Moreover, if R is an integral domain then R[x] is also an integral domain. 

Now let D be an integral domain. We say that b divides a in D if a = bc for some 
c G D. An element u G D is called a unit if u divides 1, i.e. if u has a multiplicative inverse. 
An element befl is called an associate of a G D if b = ua for some unit u G D. A 
nonunit p G D is said to be irreducible if p = ab implies a or & is a unit. 

An integral domain D is called a unique factorization domain if every nonunit a G D 
can be written uniquely (up to associates and order) as a product of irreducible elements. 

Example BJO: The ring Z of integers is the classical example of a unique factorization domain. 

The units of Z are 1 and —1. The only associates of n e Z are n and — n. The 
irreducible elements of Z are the prime numbers. 

Example B.ll : The set D = {a+ by/i3 : a, b integers} is an integral domain. The units of D 
are ±1, 18 ± 5\/l3 and -18 ± 5V13. The elements 2, 3 - y/TJ and -3 - Vl3 are 
irreducible in D. Observe that 4 = 2 • 2 = (3 — Vl3 )(— 3 - Vl3 ). Thus D is not 
a unique factorization domain. (See Problem B.40.) 



MODULES 

Let M be a nonempty set and let R be a ring with a unit element. Then M is said to be a 
(left) R-module if M is an additive abelian group and there exists a mapping RxM-* M 
which satisfies the following axioms: 
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[Mi] r(mi + m 2 ) - rm x + rm 2 
[M 2 ] (r + s)m = rm + sm 
[M s ] (rs)m = r(sm) 
[M 4 ] I'm — m 

for any r,s G R and any m s G M. 

We emphasize that an JR-module is a generalization of a vector space where we allow 
the scalars to come from a ring rather than a field. 

Example B.12: Let G be any additive abelian group. We make G into a module over the ring Z of 
integers by denning 

n times 



ng = g + g + • • ■ + g, Og = 0, (~n)g = -ng 
where n is any positive integer. 

Example B.13: Let R be a ring and let / be an ideal in R. Then / may be viewed as a module over R. 

Example B.14: Let V be a vector space over a field K and let T : V -* V be a linear mapping. 

We make V into a module over the ring K[x] of polynomials over K by defining 
f(x)v = f(T) (v). The reader should check that a scalar multiplication has been 
defined. 

Let M be a module over R. An additive subgroup N of M is called a submodule of M 
if u G N and k G R imply ku G N. (Note that N is then a module over R.) 

Let M and M ' be /2-modules. A mapping T:M^M' is called a homomorphism (or: 
R-homomorphism or R-linear) if 

(i) + = r(tt) + T(v) and (ii) T(fc«) = fcT(w) 

for every u, v G M and every k G R. 



Problems 

GROUPS 

BJ. Determine whether each of the following systems forms a group G: 

(i) G — set of integers, operation subtraction; 

(ii) G = {1, —1}, operation multiplication; 

(iii) G = set of nonzero rational numbers, operation division; 

(iv) G = set of nonsingular n X n matrices, operation matrix multiplication; 

(v) G = {a + bi : a, b G Z}, operation addition. 

B.2. Show that in a group G: 

(i) the identity element of G is unique; 

(ii) each j£G has a unique inverse a -1 €= G; 

(iii) (o- 1 )- 1 = a, and (aft)-i = 6-i<*-i; 

(iv) a& = ac implies b = c, and &a = ca implies 6 = c. 

B.3. In a group G, the powers of a£G are defined by 

o° = e, a" = aa" -1 , o _n = (a™) -1 , where n£N 

Show that the following formulas hold for any integers r,«,t£ Z: (i) a r a s — a r+s , (ii) (a r )» = a rs , 
(iii) («>•+«)« = ort+«t. 



APPENDIX B] 



ALGEBRAIC STRUCTURES 



325 



B.4. Show that if G is an abelian group, then (oft)" = a B b» for any a, b G G and any integer n G Z. 

B.5. Suppose G is a group such that {ab) 2 = a 2 b 2 for every a, 6 G G. Show that G is abelian. 

B.6. Suppose if is a subset of a group G. Show that H is a subgroup of G if and only if (i) H is non- 
empty, and (ii) a, 6 G H implies aft -1 G fl. 

B.7. Prove that the intersection of any number of subgroups of G is also a subgroup of G. 

B.8. Show that the set of all powers of a G G is a subgroup of G; it is called the cyclic group generated 
by a. 

B.9. A group G is said to be cyclic if G is generated by some a G G, i.e. G = {a.™ : n G Z}. Show that 
every subgroup of a cyclic group is cyclic. 

B.10. Suppose G is a cyclic subgroup. Show that G is isomorphic to the set Z of integers under addition 
or to the set Z n (of the integers modulo n) under addition. 

B.I1. Let H be a subgroup of G. Show that the right (left) cosets of H partition G into mutually disjoint 
subsets. 

B.12. The order of a group G, denoted by |G|, is the number of elements of G. Prove Lagrange's theorem: 
If H is a subgroup of a finite group G, then |J?| divides \G\. 

B.13. Suppose |G| — p where p is prime. Show that G is cyclic. 

B.14. Suppose H and N are subgroups of G with N normal. Show that (i) HN is a subgroup of G and 
(ii) HnN is a normal subgroup of G. 

B.15. Let flbea subgroup of G with only two right (left) cosets. Show that H is a normal subgroup of G. 

B.16. Prove Theorem B.l: Let H be a normal subgroup of G. Then the cosets of H in G form a group 
G/H under coset multiplication. 

B.17. Suppose G is an abelian group. Show that any factor group GIH is also abelian. 

B.18. Let / : G -» G' be a group homomorphism. Show that: 

(i) /(e) = e' where e and e' are the identity elements of G and G' respectively; 

(ii) /(a- 1 ) = /(a)- 1 for any a G G. 

B.19. Prove Theorem B.2: Let / : G -» G' be a group homomorphism with kernel K. Then If is a normal 
subgroup of G, and the quotient group G/K is isomorphic to the image of /. 

B.20. Let G be the multiplicative group of complex numbers z such that \z\ = 1, and let R be the additive 
group of real numbers. Prove that G is isomorphic to R/Z. 

B.21. For a fixed g G G, let ff:G-»G be defined by g (a) = g~*ag. Show that G is an isomorphism of 
G onto G. 

B.22. Let G be the multiplicative group of n X n nonsingular matrices over R. Show that the mapping 
A l-> \A\ is a homomorphism of G into the multiplicative group of nonzero real numbers. 

B.23. Let G be an abelian group. For a fixed n G Z, show that the map a h* a" is a homomorphism 
of G into G. 

B.24. Suppose H and AT are subgroups of G with 2V normal. Prove that HnN is normal in H and 
HI (HnN) is isomorphic to HN/N. 

RINGS 

B.25. Show that in a ring R: 

(i) o • 0 = 0 • a = 0, (ii) o(-6) = (-0)6 = -06, (iii) (-a)(-6) = ab. 

B.26. Show that in a ring with a unit element: (i) (— l)a = —a, (ii) (— 1)(— 1) = 1. 
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B.27. Suppose a? = a for every a G ft. Prove that ft is a commutative ring. (Such a ring is called a 

Boolean ring.) 

B.28. Let R be a ring with a unit element. We make R into another ring ft by defining a®b = a+b + 1 
and^ a • b = ab + a + b. (i) Verify that ft is a ring, (ii) Determine the 0-element and 1-element 
of R. 

B.29. Let G be any (additive) abelian group. Define a multiplication in G by a • b = 0. Show that this 
makes G into a ring. 

B.30. Prove Theorem B.3: Let / be a (two-sided) ideal in a ring R. Then the cosets {a + 1 :. a £ R} form 
a ring under coset addition and coset multiplication. 

B.31. Let Jj and I 2 be ideals in R. Prove that I t + 1 2 and Jin/ 2 are also ideals in R. 

B.32. Let R and R' be rings. A mapping / : R -» R' is called a homomorphism (or: ring homomorphism) if 

(i) f(a +b) = f(a) + f(b) and (ii) f(ab) = f(a)f(b), 

for every a, 6 £ ft. Prove that if / : R -» ft' is a homomorphism, then the set X = {r £ ft : f(r) = 0} 
is an ideal in ft. (The set K is called the kernel of /.) 

INTEGRAL DOMAINS AND FIELDS 

B.33. Prove that in an integral domain D, if ab = ac, a ¥° 0, then b — c. 
B.34. Prove that F = {a + by/2 : a, b rational} is a field. 

B.35. Prove that D = {a + b\/2 : a, b integers} is an integral domain but not a field. 
B.36. Prove that a finite integral domain D is a field. 
B.37. Show that the only ideals in a field K are {0} and K. 

B.38. A complex number o + bi where a, b are integers is called a Gaussian integer. Show that the set G 
of Gaussian integers is an integral domain. Also show that the units in G are ±1 and ±i. 

B.39. Let D be an integral domain and let / be an ideal in D. Prove that the factor ring D/I is an integral 
domain if and only if / is a prime ideal. (An ideal / is prime if ab S I implies aG I or 6 £ /.) 

B.40. Consider the integral domain D = {a + byf\Z : a, b integers} (see Example B.ll). If a — a+ b\fl3 , 
we define N(a) = a?- 1362. p r0V e: (i) N(af3) = N(a)N(fi); (ii) a is a unit if and only if N{a) = ±1; 
(iii) the units of D are ±1, 18 ± 5^13 and -18 ± 5^13 ; (iv) the numbers 2, 3 - y/l3 and -3 - Vl3 
are irreducible. 



MODULES 

B.41. Let M be an ft-module and let A and B be submodules of M. Show that A+B and AnB are also 
submodules of M. 

B.42. Let M be an ft-module with submodule N. Show that the cosets {u + N : u G M} form an ft-module 
under coset addition and scalar multiplication defined by r(u + N) = ru + N. (This module is de- 
noted by M/N and is called the quotient module.) 

B.43. Let M and M' be ft-modules and let f : M -* M' be an ft-homomorphism. Show that the set 
K = {uGM: f(u) = 0} is a submodule of /. (The set K is called the kernel of /.) 

B.44. Let M be an ft-module and let E(M) denote the set of all ft-homomorphism of M into itself. Define 
the appropriate operations of addition and multiplication in E(M) so that E(M) becomes a ring. 
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Polynomials over a Field 

INTRODUCTION 

We will investigate polynomials over a field K and show that they have many properties 
which are analogous to properties of the integers. These results play an important role 
in obtaining canonical forms for a linear operator T on a vector space V over K. 



RING OF POLYNOMIALS 

Let K be a field. Formally, a polynomial / over K is an infinite sequence of elements 
from K in which all except a finite number of them are 0: 

/ = (. . ., 0, a», . . ., Oi, do) 

(We write the sequence so that it extends to the left instead of to the right.) The entry a* 
is called the kth coefficient of /. If n is the largest integer for which a„ ^ 0, then we say 
that the degree of / is n, written 

deg/ = n 

We also call a n the leading coefficient of /, and if o» = 1 we call / a monic polynomial. On 
the other hand, if every coefficient of / is 0 then / is called the zero polynomial, written 
/ = 0. The degree of the zero polynomial is not defined. 

Now if g is another polynomial over K, say 

g - (..., 0, &»,..., 6i, & 0 ) 

then the sum f + g is the polynomial obtained by adding corresponding coefficients. That 
is, if m — n then 

f + ff = (. . 0, a», . . ., dm + bm, . . ., ai + 6i, oo + bo) 
Furthermore, the product fg is the polynomial 

fg = ( . . . , 0, dn&m, . . - , fflibo + eto&i, ao&o) 
that is, the fcth coefficient c k of fg is 

k 

Ck = 2 Oi&k-i = a <>bk + ai6k-i + • • • + dkbo 

i=0 

The following theorem applies. 

Theorem C.l: The set P of polynomials over a field K under the above operations of addi- 
tion and multiplication forms a commutative ring with a unit element 
and with no zero divisors, i.e. an integral domain. If / and g are nonzero 
polynomials in P, then deg {fg) = (deg /)(deg g). 
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NOTATION 

We identify the scalar a 0 G K with the polynomial 

Oo = ( . . . , 0, ao) 
We also choose a symbol, say t, to denote the polynomial 

* = (...,0,1,0) 

We call the symbol t an indeterminant. Multiplying t with itself, we obtain 

t 2 = (...,0,1,0,0), i 3 = (...,0,1,0,0, 0), ... 

Thus the above polynomial / can be written uniquely in the usual form 

/ = a n t n + • • • + ait + do 

When the symbol t is selected as the indeterminant, the ring of polynomials over K is 
denoted by 

and a polynomial / is frequently denoted by f(t). 

We also view the field K as a subset of K[t] under the above identification. This is pos- 
sible since the operations of addition and multiplication of elements of K are preserved 
under this identification: 

(...,0, oo) + (..., 0, 6o) = (...,0, ao + bo) 
(..., 0, oo) •(..., 0, 6o) = (...,0, aob 0 ) 

We remark that the nonzero elements of K are the units of the ring K[t). 

We also remark that every nonzero polynomial is an associate of a unique monic poly- 
nomial. Hence if d and d' are monic polynomials for which d divides d' and d' divides d, 
then d = d'. (A polynomial g divides a polynomial / if there is a polynomial h such that 
/ = kg.) 

DIVISIBILITY 

The following theorem formalizes the process known as "long division". 

Theorem C.2 (Division Algorithm) : Let / and g be polynomials over a field K with g ¥* 0. 

Then there exist polynomials q and r such that 

f = qg + r 

where either r = 0 or deg r < deg g. 

Proof: If f — 0 or if deg / < deg g, then we have the required representation 

f = Og + f 

Now suppose deg / — deg g, say 

/ = a n t n + • • • + ait + ao and g = b m t m + • ■ • + bit + b 0 
where a*, b m ¥■ 0 and n — m. We form the polynomial 

U = f-^V-ng (1) 

Om 

Then deg fi < deg /. By induction, there exist polynomials qi and r such that 

fi = Qig + r 
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where either r = 0 or degr < deg g. Substituting this into (1) and solving for /, 

/ = (qi +^t»- m ^g + r 
which is the desired representation. 

Theorem C.3: The ring K[t) of polynomials over a field K is a principal ideal ring. If I is 
an ideal in K[t], then there exists a unique monic polynomial d which gen- 
erates 7, that is, such that d divides every polynomial / G I. 

Proof. Let d be a polynomial of lowest degree in I. Since we can multiply d by a non- 
zero scalar and still remain in I, we can assume without loss in generality that d is a monic 
polynomial. Now suppose / G I. By Theorem C.2 there exist polynomials q and r such that 

/ = qd + r where either r — 0 or deg r < deg d 

Now f,d G I implies qd G I and hence r = f — qd G I. But d is a polynomial of lowest 
degree in I. Accordingly, r = 0 and / = qd, that is, d divides /. It remains to show that 
d is unique. If d' is another monic polynomial which generates I, then d divides d' and d' 
divides d. This implies that d = d', because d and d' are monic. Thus the theorem is 
proved. 

Theorem C.4: Let / and g be nonzero polynomials in K[t]. Then there exists a unique 
monic polynomial d such that: (i) d divides / and g; and (ii) if d' divides 
/ and g, then d' divides d. 

Definition: The above polynomial d is called the greatest common divisor of / and fir. If 
d — 1, then / and g are said to be relatively prime. 

Proof of Theorem C4. The set 7= {mf + ng: m,nGK[t]} is an ideal. Let d be the 
monic polynomial which generates I. Note f,g € I; hence d divides / and g. Now suppose 
d' divides / and g. Let J be the ideal generated by d'. Then f,g GJ and hence Icj. 
Accordingly, d G J and so d' divides d as claimed. It remains to show that d is unique. 
If di is another (monic) greatest common divisor of / and g, then d divides di and di divides 
d. This implies that d — di because d and di are monic. Thus the theorem is proved. 

Corollary C.5: Let d be the greatest common divisor of the polynomials / and g. Then 
there exist polynomials m and n such that d — mf + ng. In particular, if 
/ and fir are relatively prime then there exist polynomials m and n such 
that mf + ng = 1. 

The corollary follows directly from the fact that d generates the ideal 

I = {mf + ng: m,nGK[t]} 

FACTORIZATION 

A polynomial p G K[t] of positive degree is said to be irreducible if p — fg implies 
/ or g is a scalar. 

Lemma C.6: Suppose p G K[t] is irreducible. If p divides the product fg of polynomials 
f,g G K[t), then p divides / or p divides g. More generally, if p divides the 
product of n polynomials /1/2. . ./*, then p divides one of them. 

Proof. Suppose p divides fg but not /. Since p is irreducible, the polynomials / and 
p must then be relatively prime. Thus there exist polynomials m,nG K[t] such that 
mf + np — 1. Multiplying this equation by g, we obtain mfg + npg = g. But p divides fg 
and so mfg, and p divides npg; hence p divides the sum g = mfg + npg. 
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Now suppose p divides /1/2. . ./«. If p divides f u then we are through. If not, then by 
the above result p divides the product / 2 . . ./». By induction on n, p divides one of the poly- 
nomials U, . . . , f n . Thus the lemma is proved. 

Theorem C.7 (Unique Factorization Theorem): Let / be a nonzero polynomial in K[t], 
Then / can be written uniquely (except for order) as a product 

/ = &P1P2. . .Pn 

where k G K and the Pi are monic irreducible polynomials in K[t]. 

Proof: We prove the existence of such a product first. If / is irreducible or if f G K, 
then such a product clearly exists. On the other hand, suppose / = gh where / and g are 
nonscalars. Then g and h have degrees less than that of /. By induction, we can assume 

g — kigig* . . . g r and h — kzhihz . . . h s 

where hi, fcGX and the g { and hj are monic irreducible polynomials. Accordingly, 

/ = (kiki)gig2 . . . g r hih2 . . . h a 

is our desired representation. 

We next prove uniqueness (except for order) of such a product for /. Suppose 

/ = kpip 2 ...p„ = k'qiq 2 ...q m 

where k,k' GK and the pi, . . . , p n , g u . . . , q m are monic irreducible polynomials. Now pi 
divides k'qi . . . q m . Since pi is irreducible it must divide one of the qi by the above lemma. 
Say pi divides qi. Since pi and qi are both irreducible and monic, pi = qi. Accordingly, 

kp2. . .p n = k'qi. . .q m 

By induction, we have that n = m and P2 = qi, . . . , p% — q m for some rearrangement of 
the qi. We also have that k = k'. Thus the theorem is proved. 

If the field K is the complex field C, then we have the following result which is known 
as the fundamental theorem of algebra; its proof lies beyond the scope of this text. 

Theorem C.8 (Fundamental Theorem of Algebra): Let f{t) be a nonzero polynomial 
over the complex field C. Then /(*) can be written uniquely (except for 
order) as a product 

f(t) = k(t-ri)(t-r 2 )>--(t-r n ) 
where k, n G C, i.e. as a product of linear polynomials. 

In the case of the real field B we have the following result. 

Theorem C.9: Let f(t) be a nonzero polynomial over the real field R. Then f(t) can be 
written uniquely (except for order) as a product 

f(t) = kpi{t)p2{t) -" Pm (t) 

where k G R and the pi{t) are monic irreducible polynomials of degree one 
or two. 
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Inverse, 

mapping, 123 

matrix, 44, 176 
Invertible, 

linear operator, 130 

matrix, 44 
Irreducible, 323, 329 
Isomorphism of 

algebras, 169 

groups, 321 

inner product spaces, 286, 311 
vector spaces, 93, 124 

Jordan canonical form, 226 

Kernel, 123, 321, 326 

2 2 -space, 280 

Line segment, 14, 260 

Linear combination 

of equations, 30 

of vectors, 66 
Linear dependence, 86 

in R", 28 

Linear equations, 18, 127, 176, 251, 282 
Linear functional, 249 
Linear independence, 86 

in R", 28 
Linear mapping, 123 

matrix of, 150 

rank of, 126 
Linear operators, 129 
Linear span, 66 

Mapping, 121 

linear, 123 
Matrices, 35 

addition, 36 

augmented, 40 

block, 45 

change of basis, 153 
coefficient, 40 
column, 35 
congruent, 262 
determinant, 171 
diagonal, 43 
echelon, 41 
equivalent, 61 
Hermitian, 266 
identity, 43 
multiplication, 39 
normal, 290 
rank, 90 
row, 35 

row canonical form, 42, 68 
row equivalent, 41 
row space, 60 
scalar, 43 

scalar multiplication, 36 
similar, 155 
size, 35 
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Matrices (cont.) 

square, 43 

symmetric, 65, 288 

transition, 153 

transpose, 39 

triangular, 43 

zero, 37 
Matrix representation, 

bilinear forms, 262 

linear mappings, 150 
Maximal independent set, 89 
Minimal polynomial, 202, 212 
Minkowski's inequality, 10 
Minor, 174 
Module, 323 
Monic polynomial, 201 
Multilinear, 178, 277 
Multiplication of matrices, 37, 39 

N (positive integers), 315 
it-space, 2 
n-tuple, 2 
Nilpotent, 225 

Nonnegative semi-definite, 265 
Nonsingular, 

linear mapping, 127 

matrix, 130 
Norm, 279 

in R", 4 

Normal operator, 286, 290, 303 
Normal subgroup, 320 
Normalized vector, 280 
Null set, 315 
Nullity, 126 

Odd, 

function, 73 

permutation, 171 
One-to-one mappings, 123 
Onto mappings, 123 
Operations with linear mappings, 128 
Operators (see Linear operators) 
Ordered pair, 318 
Orthogonal 

complement, 281 

matrix, 287 

operator, 286 

vectors, 3, 280 
Orthogonally equivalent, 288 
Orthonormal, 282 

Parallelogram law, 307 
Parity, 171 
Partition, 319 
Permutations, 171 
Polar form, 264, 307 
Polynomials, 327 
Positive 

matrix, 310 

operator, 288 
Positive definite, 

bilinear form, 265 

matrix, 272, 310 

operator, 288 



Primary decomposition theorem, 225 

Prime ideal, 326 

Principal ideal, 322 

Principal minor, 219 

Product set, 317 

Projection operator, 243, 308 

orthogonal, 281 
Proper 

subset, 316 

value, 198 

vector, 198 

Q (rational numbers), 315 
Quadratic form, 264 
Quotient, 

group, 320 

module, 326 

ring, 322 

set, 319 

space, 229 

R (real field), 315 

R» 2 

Rank, 

bilinear form, 262 

linear mapping, 126 

matrix, 90, 195 
Rational canonical form, 228 
Relation, 318 
Relatively prime, 329 
Ring, 322 
Row, 

canonical form, 42 

equivalent matrices, 41 

of a matrix, 35 

operations, 41 

rank, 90 

reduced echelon form, 41 
reduction, 42 
vector, 36 

Scalar, 2, 63 

mapping, 219 

matrix, 43 
Scalar multiplication, 69 

of linear mappings, 128 

of matrices, 36 
Second dual space, 251 
Self -adjoint operator, 285 
Set, 315 
Sgn, 171 

Sign of a permutation, 171 
Signature, 265, 266 
Similar matrices, 155 
Singular mappings, 127 
Size of a matrix, 35 
Skew-adjoint operator, 285 
Skew-symmatric bilinear form, 263 
Solution, 

of linear equations, 18, 23 

space, 65 
Span, 66 

Spectral theorem, 291 
Square matrices, 43 
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Subgroup, 320 
Subring, 322 
Subset, 315 

Subspace (of a vector space), 65 

sum of, 68 
Surjective mapping, 123 
Sylvester's theorem, 265 
Symmetric, 

bilinear form, 263 

matrix, 65 

operator, 285, 288, 300 
System of linear equations, 19 

Trace, 155 

Transition matrix, 153 
Transpose, 

of a linear mapping, 252 

of a matrix, 39 
Transposition, 172 
Triangle inequality, 293 
Triangular, 

form, 222 

matrix, 43 
Trivial solution, 19 

Union of sets, 316 



Unique factorization, 323 
Unit vector, 280 
Unitarily equivalent, 288 
Unitary, 

matrix, 287 

operator, 286 

space, 279 
Universal set, 316 
Upper triangular matrix, 43 
Usual basis, 88, 89 

Vector, 63 

in C n , 5 

in R», 2 
Vector space, 63 
Venn diagram, 316 

Z (integers), 315 

Z n (ring of integers modulo n), 323 
Zero, 

mapping, 124 

matrix, 37 

of a polynomial, 44 

solution, 19 

vector, 3, 63 
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